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RECEPTOR SPECIFIC BACTERIAL ADHESINS AND THEIR USE 
FIELD OF INVENTION 

The present invention pertains to naturally occurring bacte- 
rial adhesins and derivatives and variants hereof, having the 
5 ability to bind to pre -determined, specifically selected 
receptors, and to the use of such adhesins in the targeting 
of active compounds and microbial cells to locations compris- 
ing such selected receptors* 



This invention was supported in part by the US National 
10 Institute of Health (NIH) , under grant #DE07218, and the US 
Veterans Administration. The US government has certain rights 
ih the invention. 



TECHNICAL BACKGROUND AND PRIOR ART 

The ability to adhere or bind specifically to, and in many 
15 instances, to colonize an animate or inanimate surface is of 
paramount importance in microbial ecology and pathogenesis. 
Such specific receptor binding is provided by microbial 
adhesins which play a key role in bacterial /host and 
viral/host recognition and interaction and for the recogni- 
20 tion of any specific surface by a microorganism. 

Accordingly, adhesion of bacteria to host surfaces is common- 
ly regarded as an essential step enabling bacteria to become 
established as members of the normal flora of host organisms 
or to cause an infection (refs. 7, 18). Bacterial lectins are 

25 the most common and most thoroughly studied type of adhesins 
among both gram negative and gram positive bacteria (ref . 
40) . Evolutionary pressures have selected lectins for adhes- 
ive functions probably due to the abundance of glycoconjugat- 
es on animate and inanimate surfaces. One class of structures 

30 that a large range of gram-positive and gram-positive 

bacteria including Escherichia coll and other members of the 
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family Enterobacteriaceae, have evolved to adhere to host 
glycoproteins in a saccharide -dependent manner are surface 
fibrils called fimbriae (ref. 14) or pili (ref. 10). Coloniz- 
ation Factor Antigen (CFA) type I and Colonization Factor 
5 Antigen (CFA) type II are specific examples of such fimbriae. 

By far the most common of the enterobacterial fimbriae is 
type l, or mannose- specif ic (MS) fimbriae (refs. 11, 13, 14, 
23 ) . Type l fimbriae are heteropolymers of four different 
10 subunits (refs. 28, 44). For each fimbria, about 1000 copies 
of a 17-kDa primary structural subunit designated FimA (or 
PilA) , are polymerized into a right-handed helix surrounding 
a hollow axial core (ref. 11). Three ancillary subunits, 
FimF, FimG and FimH, are also polymerized into the fimbrial 
structure, but comprise only 1-3% of the fimbrial mass (refs. 
20, 24, 27, 32) . 



15 



The 28 kDa FimH subunit has been shown by several direct and 
indirect tests to be the actual fimbrial lectin (refs. 2, 4, 
20, 21, 27, 29, 32, 36, 55), although its function may be 

20 affected by other subunits (ref. 55). The FimA subunit is 
highly variable, but the FimH subunit is highly conserved 
antigenically and genetically among enterobacteria (ref. 1) . 
Interactions between type 1 fimbriae and D- mannose -containing 
receptors have been shown in a number of studies to play a 

25 key role in the infectious process (refs. 2, 4, 9, 19, 25, 
26, 31, 33, 44, 50) . 

Detailed analysis of adhesion- inhibition or agglutination- 
inhibition by various mannosides and manno- oligosaccharides 
have suggested that the combining site of the type 1 adhesin 
is in the form of an extended pocket corresponding to the 
size of a trisaccharide and fitting best the structure a-D- 
Manp-(i-3)-0-D-Manp-(l-4)-D-GlcNac (ref. 16). A hydrophobic 
region within or close to the combining site was also pre- 
dicted in these studies. A similar pattern of specificity was 
found independently in indirect adhesion- inhibition studies, 
as well as in direct adhesion studies using "neoglycolipids" 



30 



35 
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as receptors (refs. 37, 47). The combining site of the Kleb- 
siella pneumoniae type 1 adhesin was shown to be similar to 
the Escherichia coli adhesin, whereas the Salmonella typhimu- 
rium type 1 adhesin combining site appears to be smaller and 
devoid of a hydrophobic region (ref . 16) Thus, it has long 
been thought that type 1 fimbriae of enterobacteria were 
functionally quite similar and that the primary essential 
characteristic of any potential receptor was the presence of 
terminal al- 3 -linked mannosyl residues. 

Recently it has been reported that the type l fimbriated, K- 
12-derived E. coli strain CSH-50 exhibits mannose- sensitive 
peptide-binding activity (ref. 51). CSH-50 E. coli bound to 
yeast mannan (Mn) , a highly mannosylated glycoprotein, and to 
human plasma fibronectin (Pn) when immobilized on assay 
wells. Adhesion to Mn, but not to Fn, was essentially elimin- 
ated by periodate treatment. Furthermore, CSH-50 E. coli 
adhered in a mannose -sensitive fashion to non- glycosylated 
peptide fragments of Fn and to a synthetic peptide copying 
the first 30 residues of the Fn molecule, FnSpl. Fimbriae 
purified from these organisms also bound to Fn and FnSpl a 
well -characterized recombinant strain of E. coli PC31 expres- 
sing type 1 fimbriae, HB101 (pPKL4) , adhered to Mn, but did 
not adhere to the other substrata. Fimbriae purified from 
HB101 (pPKL4) did not adhere to Fn or FnSpl. Thus, E. coli 
type 1 fimbriae appeared to be functionally heterogeneous. 

Several E. coli isolates obtained from human urine also 
expressed peptide-binding activity similar to that of CSH-50 
indicating that this new phenotype was not restricted to a 
laboratory strain. Other isolates expressed an adhesive 
activity similar to that of HBl01(pPKL4) . A third class of 
type 1 f imbriae-mediated adhesive phenotype was also observed 
among these isolates. 

The FimH subunit is the D-mannose- sensitive adhesin of type 1 
fimbriae, common i.a. to the Enterobacteriaceae. It is pres- 
ently widely accepted that host receptors are strictly 
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limited to glycoproteins containing terminal mannosyl resi- 
dues (refs. 16, 37, 41, 42, 43, 47). Hereinbelow functional 
and genetic evidence is provided demonstrating that this 
generalization is not correct. Allelic variants of E. coli 
fimH genes encoding proteins differing by as little as a 
single amino acid substitution confer distinct adhesive 
phenotypes and accordingly, the fimH gene is not a single 
gene but rather a family of fimH genes . 

Surprisingly, active receptors for FimH proteins were found 
to include glycoprotein domains where mannosyl residues are 
not terminal and even protein domains devoid of saccharide. 
This unexpected adhesive diversity within the fimH family 
broadens the scope of potential receptors for bacterial 
adhesion and may lead to a fundamental change in the under- 
standing of the role(s) type l fimbriae and other bacterial 
adhesins may play in bacterial ecology or pathogenesis. 

The present findings also opens up a completely new field of 
technology, since it provides the means to design bacteria 
expressing adhesins that bind to pre -determined, specific 
receptors in a wide range of animate and inanimate locations. 
This new technology is referred to herein as Designer Adhesin 
Technology. 



30 



SUMMARY OF THE INVENTION 



Accordingly, the present invention relates in one aspect to a 
25 recombinant or mutant bacterial adhesin variant derived from 
a naturally occurring adhesin, said adhesin variant having 
altered binding properties relative to the naturally occur- 
ring adhesin from which it is derived. 



In further aspects the invention provides a FimH adhesin 
having an amino acid sequence which differs from the E. coli 
PC31 FimH adhesin by at least one amino acid and a recombi- 
nant replicon comprising a DNA sequence selected from the 
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group consisting of a sequence coding for a recombinant 
bacterial adhesin variant as defined above and a sequence 
coding for a FimH adhesin as also defined above. 

In a still further aspect, there is provided a fusion protein 
comprising an adhesin selected from the group consisting of a 
recombinant bacterial adhesin variant as defined above and a 
FimH adhesin as also defined above, and a heterologous 
polypeptide . 



The invention also pertains to a recombinant bacterial cell 
10 which expresses an adhesin selected from the group consisting 
of a recombinant bacterial adhesin variant as defined above 
and a FimH adhesin as defined above, and to a composition 
comprising a population of such cells. 



15 



20 



In one interesting aspect of the invention there is provided 
a method of isolating a bacterial cell expressing an adhesin 
having modified binding properties relative to a natively 
expressed adhesin, comprising identifying in the bacterial 
cell DNA sequence (s) coding for the binding domain (s) of said 
natively expressed adhesin and substituting at least one 
codon herein, whereby a modified adhesin molecule is 
expressed that is different in at least one amino acid from 
the adhesin expressed natively, and selecting a bacterial 
cell expressing the modified adhesin having an altered 
adhesion phenotype relative the natively expressed bacterial 
25 adhesin. 

In a further interesting aspect the invention relates to a 
method of preparing a recombinant bacterial cell that binds 
to a specific receptor moiety, comprising introducing into a 
bacterium that does not produce an adhesin binding to said 
30 receptor moiety, a DNA sequence coding for an adhesin binding 
to the receptor moiety, and selecting a bacterial cell ex- 
pressing the DNA sequence. 
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There is also provided a method of targeting a bacterial 
adhesin to a specific location, comprising (i) identifying in 
said location an adhesin- interacting receptor moiety which is 
recognizable by bacterial adhesins, said moiety preferably 
being one which is occurring abundantly, (ii) isolating a 
bacterial cell that grows in said location and expresses an 
adhesin recognizing and interacting with said receptor moie- 
ty, and administering to the location the bacterial cell or 
the adhesin under conditions where the adhesin and the 
receptor moiety are brought into interacting contact whereby 
the adhesin is associated with the receptor moiety. 
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DETAILED DISCLOSURE OF THE INVENTION 

As used herein the term "bacterial adhesins- denotes proteins 
whxch recognize and bind to a large variety of target mole- 
cules such as polysaccharides, glycolipids, glycoproteins, 
polypeptides and proteins. More than a hundred different 
adhesins have been described so far originating from a large 
variety of gram-negative and gram-positive bacteria. Adhesins 
can be present on the bacterial surface as components of 
organelles such as fimbriae, also called pili or fibrillae 
these three terms being used interchangeably herein, or as 
non-fimbrial or afimbrial adhesins (ref. 64). Examples of 
fn.mbrial or pili adhesins include the following surface 
structures in E. coli: P pili, type x fimbriae , s pili# K88 
Pili, K99 pili, CS3 pili, Fi7 pili and CS31 A; in Klebsiella 
pneumoniae, type 3 pili; in Bordetella pertussis: type 2 
pili; in Yersinia enterocolitis: Myf fibrillae; in Yersinia 
pestis: pH6 antigen and Fl envelope antigen. 

Examples of non-fimbrial cell surface structures which have 
adhesin function or which may comprise proteins having such a 
functxon include capsules, lipopolysaccharide layers, outer 
membrane proteins, NFA (non-fimbrial adhesin)-!, NFA-2, NFA- 
3, NFA- 4 , AFA (afimbrial adhesins) -I, AFA-II and AFA-III. 
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In the present context, the term -fimbriae" designates long 
thread-like bacterial surface organelles. Fimbriae are hete- 
ropolymers each consisting of about 1000 structural compo- 
nents, mostly of a single protein species. However, in many 
cases a few percent minor components are also present. Adhes- 
ins can either be identical to the major structural protein 
as in Escherichia coli K88 and CFAl fimbriae and type 4 
fimbriae of Pseudomonas, Vibrio and Neisseria, or they may be 
present as minor components as in E. coli type l and P 
fimbriae [for reviews see Krogfelt 1991 (ref. 62) ; Kaufman 
and Taylor, 1994 (ref. 60): Kuehn et al., 1994 (ref. 63); 
Klemm and Krogfelt, 1994 (ref. 61)]. m the latter case, i.e. 
when present as minor compounds, the adhesins are closely 
related in amino acid sequence to the major fimbrial compo- 
nent. As used herein the term bacterial adhesin will also 
include adhesins isolated from non-bacterial sources includ- 
ing viruses, and which are expressed in a bacterium. 

in the following, the FimH adhesin of type 1 fimbriae will be 
described structurally and functionally as a representative 
example of a bacterial adhesin. 

FimH is located at the tip of the type 1 fimbriae and also 
intercalated at intervals in the fimbrial organelle. Most 
forms of the FimH adhesin target to (bind to) oligosaccharide 
structures containing terminally located a-D-mannoside resi- 
dues [Krogfelt et al., 1990 (ref. 29)]. Based on studies with 
various D-mannose derivatives the receptor binding site of 
the FimH adhesin is assumed to be shaped like an elongated 
pocket large enough to accommodate a trisaccharide motif 
[Sharon, 1987 (ref. 65)]. 

The fimH gene encodes the precursor FimH protein of 300 amino 
acids [Klemm and Christiansen, 1987 (ref. 27)]. This precur- 
sor is processed into a mature form of 279 amino acids. The 
amino acid sequence of the E. coli PC31 FimH protein is shown 
m Table 1 below wherein cysteine residues are indicated by 
astenxes, the signal peptide is outlined in bold letters. 
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and two regions contributing to the binding site are under- 
lined (SEQ ID NO:i). (it should be noted that residue 176 is 
a proline residue and not as previously indicated when the 
PC31 FimH protein was first published, an arginine residue) : 



Table 1. Amino arid sequence of t-.h« E. cali PC31 FimH protein 
-21 ! * 

MKRVITLPAVIiLMGWSVNAWS FACKTANGTAI PIGGGS ANVYVNLAPWNVG ONLWDLS 

* 

TQIFCTNDYPETITOYVTT.ORGgAY^^ 

DKPWPVALYLTPVSSAGG^KAGSLIAVLILRQTNNYNSDDFQFVWNIYANNDVVVPTG 
* * 

GCDVS ARDVTVTLPD YPGS VP I PLTVYCAKSQNLGYYLSGTHADAGNS I FTNTAS FS PAQ 



GVGVQLTRNGTIIPANNTVSLGAVGTSAVSLGLTANYARTGGQVTAGNVQSI 



279 
IGVTFVYQ 



The FimH contains 4 cysteine residues assumed to direct 
folding of the molecule into distinct functional domains. For 
comparison FimA and the minor components FimF and FimG only 
have two cysteine residues. The localization of the cysteine 
residues in FimH points to a tandem arrangement of two ances- 
tral genes. Furthermore, similar amino acids can be found in 
similar positions in the two halves of the FimH protein. The 
"midway" point is located roughly around residue 150 in the 
mature protein. The two halves or domains of FimH have 
evolved differently with the N- terminal section becoming the 
domain harbouring the receptor binding site, whereas the C- 
terminal sector became the domain of the molecule required 
for integration into the fimbrial organelle structure, i.e. 
having the features of a structural component. 

In- frame linker insertions into the fixnH gene confirms this 
model of the FimH protein. Thus insertions in the C-terminal 
half of the molecule generally do not interfere with the 



BNSDOCID: <WO__J9620657A1J_J» 



10 



15 



20 



25 



30 



WO 95/20657 £^ PCT/DK95/00042 

receptor- binding ability whereas abolishment of receptor 
binding ability following linker insertion in the N- terminal 
is the rule (Klemm et al . , unpublished data). A similar 
domain structure has been observed in the PapG adhesin of P- 
fimbriae [Hultgren et al., 1989 (ref. 59); Kuehn et al., 1994 
(ref . 63)] . 

In accordance with the invention, the recombinant bacterial 
adhesin as defined above is one which is derived from an 
adhesin having certain binding properties, but which recombi- 
nant bacterial adhesin has altered binding properties rela- 
tive to the naturally occurring adhesin (the parent adhesin) 
from which it is derived. As used herein this feature 
encompasses situations where the adhesin variant recognizes 
and binds to receptor moieties not being recognized by the 
parent adhesin irrespective of whether the adhesin variant 
has lost its normal ability to recognize and bind to a cer- 
tain receptor moiety or certain receptor moieties, or not. 

As used herein the term "binding" indicates that the adhesin 
has a degree of affinity to the receptor moiety which enables 
it, when brought into contact herewith, to interact in a 
binding manner with this moiety whereby an adhesin- receptor 
moiety association occurs. The strength of this binding 
depends on the type of binding force which causes the inter- 
action between the receptor moiety and the adhesin. In the 
present context, such binding forces include covalent binding 
and binding by non- covalent binding forces including hydrogen 
bonds, hydrophobic interactions, van der Waal forces and 
ionic interactions. Accordingly, the term "receptor moiety" 
as used herein encompasses any moiety to which an adhesin may 
interact by the above binding forces. 



In one specific embodiment, the adhesin variant is a FimH 
mannose- sensitive adhesin normally binding to a receptor 
selected from a domain where mannosyl residues are not ter- 
minal and a domain devoid of saccharide and having an amino 
35 acid sequence which differs from the E. coli PC31 FimH adhes 



BNBOOCID: <W0_9620e«7A1JU» 



WO 95/20657 A ^fc PCT/DK95/00042 

in by at least one amino acid residue substitution, including 
an amino acid sequence differing by at least 2 amino acids, 
preferably by at least 3 amino acids, more preferably by at 
least 4 amino acids, most preferably by at least 5 amino 
5 acids. In further useful embodiments, the amino acid sequence 
may even differ by more than 5 amino acids such as at least 
6, preferably by at least 7, more preferably by at least 8, 
even more preferably by at least 9 and in particular by at' 
least 10 amino acid residues, such as by at least 12 amino 
10 acids including by at least 15. 

Accordingly, the above FimH adhesin variant is preferably at 
least 90% homologous to the PC31 FimH adhesin, such as at 
least 92% homologous, more preferably at least 93% 
homologous, even more preferably at least 94% homologous, 
most preferably at least 95% homologous, and in particular at 
least 96% homologous, e.g. at least 97% homologous. In par- 
ticularly interesting embodiments, the adhesin is at least 
98% homologous, including at least 99% homologous such as at 
least 99.5% homologous. 
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The above FimH adhesin variant can be a chimeric adhesin 
comprising amino acid sequences from different FimH adhesins 
having identical or different binding specificities. 

As it has been mentioned above, the present invention is 
generally aimed at providing the means to design bacterial 
adhesins having specific binding properties whereby bacteria 
expressing the adhesin variants or the adhesin variants in 
isolated or purified form can be designed to bind to a speci- 
fic desired target receptor moiety. Accordingly, the adhesin 
variant may in accordance with the invention be an adhesin 
variant as defined above which binds to an animate receptor 
moiety. Such receptors include receptors located on inner 
surfaces of humans and animals, such as e.g. the mucosal 
membranes of the gastrointestinal tract including the teeth 
and the oral cavity, and the mucosal membranes of the respir- 
atory and the genitourinary systems. Included are also 
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adhesin variants that bind to outer surfaces, including the 
skin, of humans and animals. 

In a further embodiment, the adhesin variant is designed so 
as to acquire the ability to bind to a plant receptor moiety. 
5 This aspect is of particular interest in relation to deliber- 
ate release to out -door or in-door environments where plants 
are cultivated, of useful recombinant bacteria having a 
desirable effect on the growth and yield of the plants. 
Such desirable bacteria are e.g. bacteria expressing a pesti- 
10 cidally active substance, i.e. a biopesticide including as 

examples pesticidal toxins produced naturally by Bacillus spp 
such as the Bacillus thuringzensis (Bt) toxin. In this con- 
text, another example is bacteria which protect plants 
against low temperature damages or bacteria which express 
gene products protecting plants against detrimental effects 
of herbicides. 
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By providing such bacteria with genes expressing adhesin 
variants which e.g. bind specifically to certain plant spe- 
cies and/or to certain locations on the plant, these useful 
bacteria will, when administered to the plant growing envi- 
ronment, be selectively associated with the target plant 
species or a specific target area of the plant. It may thus 
be desirable to have these useful bacteria administered to 
the leaves of the plants or to have the root system colonized 
25 herewith. 

Accordingly, the present invention encompasses adhesin vari- 
ants as defined herein which bind selectively or specifically 
to a phylloplane receptor moiety or which bind to receptors 
on plant roots. Similarly, adhesin variants can be provided 
which are targeted to the stem or the flowers of the plants. 

As it is mentioned above, bacterial adhesins include adhesins 
having an inherent capability to bind or interact with inani- 
mate surfaces carrying receptor moieties with which the 
adhesin can interact to become bound to the surfaces. It is 



30 
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known that certain bacterial adhesins can bind to inanimate 
surfaces including as examples glass, hydroxyapatite (a tooth 
enamel model compound) or polymer structures including plas- 
tics and polysilicates . The present invention has made it 
possible to design bacteria which bind selectively to any 
inanimate surface which carries a receptor moiety for which 
an adhesin variant binding thereto may be constructed. Accor- 
dingly, the present invention also provides an adhesin vari- 
ant as defined herein which binds to an inanimate receptor 
moiety. Such adhesin variants are particularly interesting in 
connection with the concept of bioremediation, i.e. a tech- 
nology designed to enhance degradation of chemical pollutants 
in the environment. It is clearly a significant improvement 
of this technology to have at hand bacteria which comprise 
15 genes coding for pollutant -degrading gene products and which 
also express adhesins targeting the bacteria selectively to 
the environment where the pollutants are present, e.g soil, 
aquatic environments and drinking water supply systems. 
Furthermore, adhesin variants capable of binding to tooth 
enamel are useful in the protection of teeth against caries. 



10 
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In a further embodiment, there is in accordance with the 
invention provided an adhesin variant which is part of a 
fusion protein comprising the adhesin variant and a non- 
adhesin, heterologous polypeptide. Using the FimH as an 
25 example, it has been found that fusions between a bacterial 
adhesin and other proteins can be made whereby the resulting 
fusion proteins are inserted into the cell surface organelle 
of which the adhesin is a structural part. These resulting 
hybrid adhesin- carrying cell organelles remain fully func- 
tional with respect to binding properties. Additionally, it 
has been found that large regions of non- adhesin proteins, 
e.g. regions comprising in the range of 1 to 100 amino acids 
including a range of 5 to 75 amino acids and a range of 10 to 
60 amino acids, such as regions comprising 15 to 54 amino 
acids, can be inserted into type l fimbriae without impairing 
the binding properties of the fimbriae. 
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In useful embodiments of the invention, the non-adhesin 
region of a fusion protein comprising an adhesin variant as 
defined herein include a heterologous polypeptide which is 
selected from an epitope, an enzyme, a toxic gene product and 
an antibody. 

It has significantly been found that, when fusion proteins 
are expressed in which the heterologous polypeptide is an • 
epitope or an epitope- carrying domain forming an integrated 
part of the fusion protein, and thus presented on the surface 
of the host cell expressing the fusion protein, the epitope- 
carrying polypeptide can be presented in a conformational 
form similar to its natural conformation. 

Furthermore, it has surprisingly been found that the above 
fusion proteins can be overproduced by the bacteria compris- 
ing hybrid genes coding for fusion proteins, resulting in 
excretion of the fusion proteins to the growth medium in 
large quantities. Accordingly, the excreted fusion proteins 
are then readily isolated and purified, e.g. by means of 
affinity chromatography. These findings provide the means to 
manufacture bacterial cells having on their surface hybrid 
adhesin- carrying cell organelles as well as to produce large 
quantities of excreted fusion proteins, both of which can be 
targeted to specific surfaces as determined by the binding 
properties of the adhesin variant of the fusion protein. 

The above technology of making adhesin variant -fusion pro- 
teins is useful for a range of industrially important pur- 
poses such as: 

(i) development of live vaccines targeted to specific cellu- 
lar surfaces; 

(ii) development of subunit vaccines for administration 
orally or by injection, which are targeted to pre -determined, 
specifically selected cell surfaces or mucosal surfaces ; 
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(iii) development of fusion proteins combining specific 
binding properties with specific enzymatic or toxin activ- 
ities. Such fusion proteins have applications as therapeuti- 
cal or diagnostical agents, including use in biosensors; 

(iv) use of fusion proteins as carriers of non-covalently 
linked chemical moieties whereby the adhesin part of the 
protein is used to target the chemical moiety to specific 
locations and the non-adhesin part carries and then releases 
the moiety when the fusion protein has reached its target. 
Examples of chemical entities which may be linked to the 
fusion protein include imaging agents and pharmacologically 
active components. Examples of applications for this use 
include imaging of atherosclerotic plaques or tumor tissues, 
and delivery of chemical agents at specific locations in or 
on microbial, human, animal or plant cells including specific 
tissues or tissue components; 

(v) development of fusion proteins which are useful in affin- 
ity purification processes. 

It has been found that the fimH gene coding for the E. coli 
FimH adhesin is not a single gene but rather a family of fimH 
genes, and accordingly it has now been established that 
allelic variants of E. coli fimH genes exist that encode 
adhesin proteins which, relative to the known E. coli PC31 
fimH gene product differ by as little as a single amino acid 
substitution and confer distinct binding or adhesive 
phenotypes . 

Accordingly, as it has been mentioned above, the present 
invention relates in a further aspect to a FimH adhesin 
having an amino acid sequence which differs from the E. coli 
PC31 FimH adhesin as defined above by substitution of at 
least one amino acid, it will be understood that such an 
adhesin encompasses naturally occurring adhesins as well as 
adhesins which are encoded by recombinant or mutant fimH 
genes. In this context the term "fimH gene- denotes a gene 
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coding for a gene product which can be integrated into a type 
1 fimbria and which confers to the fimbria the ability to 
recognize and bind to a receptor. 

The FimH adhesin as defined above may be an adhesin having 
5 its inherent binding properties or an adhesin variant which 
in relation to an adhesin encoded by a naturally occurring 
gene from which the gene coding for the adhesin variant is 
derived, has altered binding properties. Furthermore, the 
FimH adhesin may be either mannose- sensitive or mannose- 
10 insensitive. The term "mannose -sensitive" is used herein to 
designate that the binding of an adhesin is inhibited in the 
presence of mannose residues. In one specific embodiment, the 
FimH adhesin may be a FimH adhesin normally binding to a 
receptor moiety selected from a domain where mannosyl resi- 
dues are not terminal and a domain devoid of saccharide such 
as e.g. a glycol ipid, a glycoprotein, a protein, a 
polypeptide and a peptide, including a hormone. Examples of 
proteins to which a FimH adhesin according to the present 
invention may bind include as examples animal proteins such 
as a casein including K-casein, a gelatine, a globin, an 
albumen and a collagen, and vegetable proteins including soy 
protein. 
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The FimH adhesin according to the invention include an adhes- 
in having an amino acid sequence which differs from the E 
coll PC31 FimH adhesin by at least 2 amino acid residues, 
such as an amino acid sequence differing by at least 3 amino 
acids, preferably by at least 4 amino acids, more preferably 
by at least 5 amino acids, most preferably by at least 6 
amino acids. In further useful embodiments, the amino acid 
sequence may even differ by more than 6 amino acids such as 
at least 7, preferably by at least 8, more preferably by at 
least 9, even more preferably by at least 10 and in particu- 
lar by at least li amino acid residues, such as by at least 
12 amino acids including by at least 15. 
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Accordingly, the above FimH adhesin is preferably at least 
90% homologous to the PC31 FimH adhesin, such as at least 92% 
homologous, more preferably at least 93% homologous, even 
more preferably at least 94% homologous, most preferably at 
least 95% homologous, and in particular at least 96% 
homologous, e.g. at least 97% homologous. In particularly 
interesting embodiments, the adhesin is at least 98% 
homologous, including at least 99% homologous or at least 
99.5% homologous . 



10 In one specific embodiment, the FimH adhesin as defined above 
is one which, when tested for binding to yeast mannan (Ma) , 
human plasma fibronectin (Fn) , periodate treated Fn and the 
synthetic peptide FnSpl comprising the first 30 amino acids 
of Fn, only binds to Mn. In the following, an adhesin having 

15 this pattern of binding properties is designated an 

M class FimH adhesin. In other specific embodiments, the FimH 
adhesin is an adhesin which, when tested for binding to the 
above compounds, binds to Mn and Fn (MF class FimH adhesin) 
or an adhesin which among these compounds bind to all of 

20 these (MFP class FimH adhesin) . 

It has been found that bacteria expressing FimH adhesins of 
the above MFP class bind in a mannose- sensitive (MS) manner 
to polyoxyethylene sorbitan monolaurate (Tween 20) and a 
little less well to polyoxyethylene sorbitan monooleate 
25 (Tween 80) . Furthermore, bacteria expressing MFP class FimH 
adhesins make a much tougher pellicle than bacteria expres- 
sing other types of adhesins. In the present context, the 
term "pellicle" indicates a layer or film of bacteria that 
forms at the air/liquid interface of a liquid growth medium. 
This noticeable phenomenon might be of particular interest 
where there is a reason to concentrate microorganisms at the 
surface of an aquatic environment, such as e.g. bacterial 
cells which in accordance with the present invention express 
a pollutant -degrading gene product. 
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Another interesting finding is that bacteria expressing a MFP 
class adhesins bind to hydroxyapatite to a higher degree than 
do bacteria expressing a M class adhesin. Hydroxyapatite, 
especially saliva- treated hydroxyapatite is i.a. used as a 
model for tooth enamel, and accordingly, this finding indi- 
cates that bacteria expressing MFP class adhesins are par- 
ticularly useful in bacterial compositions intended for 
colonization of teeth. 



It has also been found that the MFP class adhesins bind to a 
large range of synthetic peptides and accordingly seem to 
have a broad specificity in terms of amino acid motifs. 



In further specific embodiments of the invention, the FimH 
adhesin is an adhesin which, when tested for binding to the 
five Fn- fragments obtained by thermolysin treatment as it is 
15 described in reference No. 51, only binds to the 40-kDa 

gelatin- binding fragment or which binds to all of these Fn- 
fragments, or to none of these. 

In addition to the above classes of FimH adhesins, another 
class has been identified which is designated the M L (low 

20 adhesive) class. Such an adhesin confers the ability to 

aggregate yeast cells in a mannose- sensitive (MS) fashion, in 
titers similar to M class adhesins, but surprisingly, it 
binds at only low levels to Mn or Fn and FnSpl. Furthermore, 
adhesins of this low adhesive M L class adhere poorly to MDCK, 

25 buccal cells and erythrocytes as compared with M class adhes- 
ins. Example of a M L class adhesin is one expressed by the 
recombinant E. coli strain KB 23 which differs only from the 
PC31 FimH adhesin by having an alanine instead of a valine at 
residue 27 and the FimH adhesin expressed by the human fecal 

30 E. coli isolate which is designated F-18 [McCormick et al., 
1989 (ref. 34)]. This latter adhesin differs from the PC31 
FimH in three amino acid residues and the F-18 isolate has 
been found to colonize the large intestine to a higher degree 
than certain E . coli K-12 strains do. Accordingly, it is 

35 contemplated that these M L class adhesins confer to 
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gastrointestinal bacteria the ability to colonize the large 
intestine which is significant for a live bacterial vaccine 
for exerting its immunological effect in the gastrointestinal 
tract . 



10 



Furthermore, it has been found that among M class adhesins 
adhesion is found that is not sensitive to inhibition by D- 
mannose. Such a mannose- insensitive (or mannose- resistant) M 
class adhesin is designated in the following as an M R adhes- 
in. One example of a bacterial strain expressing an M R adhes- 
in is the clinical isolate U221-3 which is mentioned in the 
following . 
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In accordance with the invention, a FimH adhesin as defined 
above can be a chimeric adhesin comprising amino acid 
sequences from different FimH adhesins. Such chimeras are 
constructed e.g. by providing multiple restriction fragments 
of a fimH gene, followed by exchanging under ligation condi- 
tions these fragments with corresponding fragments of an 
other fimH gene and cloning the ligation product as it is 
described in Example 1 below. As it is also explained below, 
recombinant plasmids containing such chimeric fimH genes can 
be transformed into a host cell and transf ormants tested for 
adhesive phenotype, allowing determination of the regions of 
each gene capable of conferring functional activity (Fig. 5) . 
These studies which are described in details below showed 
that all of the sequence changes relative to the PC31 fimH 
gene that affected binding function in the studied strains of 
E. coli CSH-50 and clinical isolates (CIs) designated #s 3, 
4, 7, io, F-18 and U221-3, respectively, occurred between 
residues 27 and 119, both included, of the 279 residue, 
3 0 mature fimH sequences. 

Accordingly, the invention encompasses in one embodiment a 
FimH adhesin comprising an amino acid sequence which differs 
from the E. coli PC31 FimH adhesin by at least one amino acid 
occurring between residues 27 and 119 of the mature FimH 
35 sequence, including a FimH adhesin comprising an amino acid 
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sequence which differs from the E. coli PC31 FimH adhesin by 
at least one amino acid occurring between residues 33 and 78 
of the mature FimH sequence. 

The selected potential receptors for a FimH adhesin as 
defined above include those animate and inanimate receptors 
mentioned above for a recombinant bacterial adhesin variant 
and the potential uses of the FimH adhesins are also the same 
as those uses described above for this recombinant bacterial 
adhesin variant. 



As mentioned above, the invention relates in a further aspect 
to a recombinant replicon comprising a DNA sequence coding 
for a recombinant bacterial adhesin variant as defined herein 
or a DNA sequence coding for a FimH adhesin as also defined 
herein. Such a replicon is suitably selected from a chromo- 
some or a plasmid. The DNA sequence includes a sequence which 
is inserted by conventional recombination techniques such as 
insertion by means of restriction enzymes and subsequent 
ligation, or the DNA sequence is provided by subjecting a 
replicon comprising a naturally occurring sequence coding for 
an adhesin to a mutagenization procedure including site- 
directed mutagenesis, insertion of a transposable element, 
mutagenization by radiation or chemical mutagenization, 
followed by selection of cells comprising a mutated sequence 
conferring altered binding properties relative to a cell 
25 comprising the wild- type sequence. 



30 



In preferred embodiments, the recombinant replicon is one 
having a broad host range including bacterial species na- 
turally occurring in soil, in aquatic environments, on inner 
and outer surfaces of humans and animals, and which is com- 
patible with replicons occurring in potential host strains. 

In one useful embodiment, the recombinant replicon as defined 
above is one wherein the DNA sequence codes for a FimH adhes- 
in having an amino acid sequence which differs from the E. 
coll PC31 FimH adhesin by at least one amino acid, including 
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an adhesin having an amino acid sequence which differs from 
the E. coli PC31 FimH adhesin by at least 2 amino acid resi- 
dues, such as an amino acid sequence differing by at least 3 
amino acids, preferably by at least 4 amino acids, more 
preferably by at least 5 amino acids, most preferably by at 
least 6 amino acids. In further useful embodiments, the amino 
acid sequence may even differ by more than 6 amino acids such 
at least 7, preferably by at least 8, more preferably by at 
least 9, even more preferably by at least 10 and in particu- 
lar by at least 11 amino acid residues, such as by at least 
12 amino acids including by at least 15. 

Accordingly, the above recombinant replicon preferably com- 
prises a DNA sequence coding for a FimH adhesin which is at 
least 90% homologous to the PC31 flmH gene, such as at least 
15 92% homologous, more preferably at least 93% homologous, even 
more preferably at least 94% homologous, most preferably at 
least 95% homologous, and in particular at least 96% 
homologous, e.g. at least 97% homologous. In particularly 
interesting embodiments, the adhesin is at least 98% 
homologous, including at least 99% homologous such as at 
least 99.5% homologous. 

In a further embodiment, the above replicon comprises a DNA 
sequence which is a chimeric flmH gene as it has been defined 
above, comprising DNA from different fimH genes. The replicon 
can also be one which comprises a further DNA sequence e.g. 
derived from a microorganism selected from a bacterium, a 
virus, a protozoan, a fungus and a yeast. This further DNA 
sequence is e.g. one coding for a heterologous polypeptide, 
including an epitope, an antibody, a toxic gene product, an 
enzyme, a pesticidally active gene product and a pollutant - 
degrading gene product. 

In useful embodiments, the replicon as defined herein com- 
prises a DNA sequence which is isolated from an Enterobacte- 
rlaceae species, including a DNA sequence which is isolated 
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from E. coli, a Klebsiella sp. , an Enterobacter sp. , a 
rex-sirzia sp. or a Salmonella sp. 

in addition to being a DNA sequence as defined above, the 
sequence can be a synthetic sequence constructed by conven- 
5 tional techniques of DNA synthesis. 

As it is also mentioned above, the present invention 
encompasses a fusion protein comprising a recombinant bacte- 
rial adhesin variant or a FimH adhesin as defined above, and 
a heterologous polypeptide. Such a polypeptide is in useful 
embodiments an immunologically active gene product i.e. an 
epitope (antigenic determinant) from a pathogenic organism, 
which polypeptide, when administered to the body of a human 
or an animal is capable of stimulating the formation of 
antibodies therein. A cell in which such an epitope is 
expressed is advantageously utilized in the preparation of 
live vaccines. Such vaccines have several advantages over 
known live vaccines: 

Firstly, the epitope forms a structural part of an adhesin 
which is embedded in a surface organelle of the vaccine 
20 cells. This implies that the hybrid DNA sequence coding for 
the epitope further comprises the means for transporting the 
epitope, when expressed, to the outer surface of the cell, 
i.e. translocating it through the cell membrane. This is 
immunologically highly advantageous, since the epitope will 
be brought more closely in contact with immunologically 
competent cells of the body to which the fusion protein- 
expressing vaccine cells are administered. 

Secondly, the adhesin part of the epitope- carrying fusion 
protein can be selected so as to have specific binding prop- 
erties whereby the vaccine cell may be targeted to a particu- 
lar location in the body where an immunological response to 
the epitope is desirable. The adhesion of the epitope- carry- 
ing cell to a particular location or region of the body will 
in this manner ensure that the cell is retained in the human 
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or animal body in that particular location for a period of 

time which is sufficient to obtain the desired immune 
response . 



10 



In accordance with the invention, a useful cell for expres- 
sion of the above fusion protein is one selected from a 
bacterial species which inherently contains an adhesin- carry- 
ing surface organelle. Such species include as examples gram- 
negative species of Enterobacteriaceae such as E. coli, 
Klebsiella spp, Salmonella spp, Yersinia spp, Vibriona'ceae , 
Hemophilus spp, Bordetella spp and Pseudomonadaceae , and 
gram-positive species such as Neisseria spp and Streptococcus 
spp. 

The epitope part of a fusion protein according to the inven- 
tion can be an epitope derived from any pathogenic organism 
15 or agent against which it is desirable to develop vaccines. 
Such pathogenic organisms include viruses, bacteria and 
eucaryotic organisms such as fungi, yeast or protozoa. 

Whereas cells expressing an epitope- carrying fusion protein 
as defined herein may be used as a live vaccine, it is also 
within the scope of the invention to provide isolated and/or 
purified cell surface organelles comprising the fusion pro- 
tein, including fimbriae and pili, as a vaccine, and it is 
also contemplated that useful vaccines may be provided where- 
in cells expressing an epitope- carrying fusion protein have 
been killed by conventional methods such as formaldehyde 
treatment or thermal treatment. 

In a further embodiment of the invention, the fusion protein 
according to the invention comprises as the non-adhesin 
polypeptide part a toxic gene product e.g. having a selective 
toxic effect on particular cells in the body such as e.g. 
cancer cells. By selecting the adhesin part as one having a 
specific binding affinity to receptors in such cells it is 
possible to have cells expressing the toxic gene product 
bound selectively to such target cells whereby these cells 
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may be killed or damaged by the toxic gene product. It is 
also possible to use isolated or purified cell organelles 
containing a fusion protein comprising the cell toxic 
(cytotoxic) gene product for the purpose of targeting the 
! toxic product. 

In a further interesting embodiment, the fusion protein 
comprises an antibody. Such an embodiment is, Inter alia, 
particularly interesting with respect to the provision of 
fusion proteins which may be used in affinity purification of 
biological compounds having binding affinity to the antibody 
part of the fusion protein. It is contemplated that cells 
expressing as part of a surface organelle, such a fusion 
protein may be utilized directly as a means of concentrating 
a biological compound, or the isolated surface organelles 
comprising the antibody- carrying fusion protein may be used 
for this purpose. 
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Furthermore, the fusion proteins as defined herein are useful 
as carriers of non- covalently bound compounds such as pharma- 
cologically active, diagnostically active and imaging com- 
pounds with the purpose of providing cells or cell organelles 
carrying the active compounds, which thereby become targetab- 
le to particular regions or locations of a body to which 
these cells or cell organelles are administered. The inven- 
tion encompasses any combination of a fusion protein as 
defined herein and an active compound which can be covalently 
bound to a fusion protein. 

As mentioned above, the present invention encompasses in one 
aspect a recombinant bacterial cell which expresses a recom- 
binant bacterial adhesin variant or a FimH adhesin as defined 
above. In one specific embodiment, the bacterial cell is one 
which comprises the above-defined recombinant replicon 
Depending on the field of application of such a cell, it may 
e.g. be selected from a soil bacterium, an aquatic bacterium, 
a bacterium which is normally associated with plants, a 
bacterium which is member of the human or animal indigenous 
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bacterial flora, or a bacterium which is adapted to colonize 
certain ecological niches such as e.g. sewage purification 
plants or certain inanimate surfaces. 

The major significant advantages which have been achieved by 
the present invention is the possibility to provide recombi- 
nant bacterial cells which are not only ecologically well- 
adapted to grow in a particular ecological environment, but 
which are also provided with means for colonizing more perma- 
nently in their ecologically natural environment . These means 
for improved ability to colonize an environment are the 
adhesins expressed by the bacteria which have been con- 
structed and/or selected so as to enable the recombinant 
bacterial cell to adhere to or bind to specific receptors in 
the environment, i.e. the bacterial cells are targeted to 
that environment. Thereby the bacteria according to the 
present invention will have an ecologically competitive 
advantage relative to organisms in the particular environment 
which do not have surface structures comprising adhesins 
binding to receptors present in the environment, at least not 
to the same extent as the bacterial cells according to the 
invention. 

in addition to the environment -specific adhesins which the 
bacterial cell expresses, the cell will have a phenotype 
which is desirable in the environment to which it is tar- 
geted. As one example, a cell according to the invention 
which is originally isolated from the human or animal indige- 
nous bacterial flora may typically be one which expresses an 
epxtope including an epitope which is part of a fusion pro- 
tein expressed by the bacterial cell . As another example may 
be mentioned a bacterial cell which is isolated from a plant 
and which expresses a pesticidally active compound such as a 
Bacillus thurincjiensls toxin. Further examples include a 
plant root -associated nitrogen- fixating bacterium isolated 
from soil which in accordance with the invention is provided 
with adhesins improving the capability of the bacterium to 
become permanently colonized to the roots of a specific plant 
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or specific plants, or a bacterium which is ecologically 
associated with an aquatic or terrestrial environment con- 
taining pollutants to be degraded or removed. 

Accordingly, the recombinant bacterial cell can be derived 
5 from any gram- negative or gram-positive bacterium for which a 
need exists to obtain improved colonization in a particular 
inanimate or animate environment. Such bacteria include as 
examples finterobactex-iaceae spp, Hemophilus spp, Neisseria 
spp, Bordetella spp, Streptococcus spp, Pseudomonadaceae spp, 
10 Vibrionaceae spp, Baccilaceae spp. 

In certain embodiments of the invention it is advantageous 
that the present recombinant bacterial cell is provided as 
one which, when it is administered to a particular location 
or environment, will not persist in that environment. Accord- 
ingly, such a recombinant bacterial cell may further comprise 
a gene coding for a gene product which, when expressed has a 
killing or cell function-limiting effect in said cell, the 
expression of said gene coding for the cell killing or cell 
function- limiting gene product being regulated in such a 
manner that the bacterial cell when targeted to receptor in a 
specific location will be killed or limited in its function 
in a pre-determined manner. The gene coding for the cell 
killing or cell function- limiting gene product is suitably 
regulated by a factor selected, from the group consisting of a 
stochastic event, the presence/absence of a chemical compound 
m the location, and a physical factor. 

In a further aspect, the invention relates to a method of 
isolating or constructing a recombinant bacterial cell ex- 
pressing an adhesin having modified binding properties rela- 
tive to a natively expressed adhesin such as a natively 
expressed FimH adhesin. As it is defined above, this method 
comprises identifying in the bacterial cell DNA sequence (s) 
coding for the binding domain (s) of said natively expressed 
adhesin and substituting at least one codon herein whereby a 
modified adhesin molecule is expressed that is different in 
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at least one amino acid from the adhesin expressed natively, 
and selecting a bacterial cell expressing the modified adhes- 
in having an altered adhesion phenotype relative to the 
natively expressed bacterial adhesin. 

5 As it is explained in details below, the binding domain can 
e.g. be identified by constructing chimeric adhes in -encoding 
genes and screening for cells which by having a region in the 
adhesin gene replaced by a corresponding heterologous region 
of a different DNA sequence, acquires a new binding 
10 phenotype. Having identified a binding domain of the natively 
expressed adhesin, recombinant cells having desirable binding 
phenotypes may be obtained by substituting one or more codons 
in the binding domain (s) to obtain expression of recombinant 
adhesins and selecting cells having the desirable phenotypes. 
The substitution of codons may be achieved by methods know 
per se such as site -directed mutagenesis using synthetic 
oligonucleotides and PCR technology or transposable elements 
or by conventional radiation or chemical mutagenization. 
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In certain useful embodiments, the above method includes 
steps whereby a non- adhesin compound is associated with the 
adhesin, e.g. a step where a gene coding for the recombinant 
adhesin is part of a hybrid gene comprising a gene coding for 
a non- adhesin polypeptide which thereby is expressed with the 
recombinant adhesin as part of a fusion protein comprising 
the adhesin. Furthermore, recombinant adhesins resulting from 
the above method may in specific embodiments comprise a non- 
covalently bound compound which is associated with the adhes- 
in when expressed. 

As mentioned above, the invention also encompasses recombi- 
nant bacterial cells having selected binding properties 
whereby cells with desirable phenotypes can colonize environ- 
ments where the presence of bacteria having a particular 
phenotype is advantageous. Accordingly, there is in a further 
aspect of the invention provided a method of preparing a 
recombinant bacterial cell that binds to a specific receptor 
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moiety, comprising introducing into a bacterium that does not 
produce an adhesin binding to said receptor moiety, a DNA 
sequence coding for an adhesin binding to the receptor moie- 
ty, and selecting a bacterial cell expressing the DNA 
5 sequence . 
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The primary objective of this method is to provide the means 
of constructing a bacterial strain having the capacity to 
colonize an environment, based on a parent strain which has 
an inherent, useful phenotype in this particular environment 
but which does not express an adhesin binding to receptor 
moieties in the environment. Accordingly, the method includes 
as a first step the isolation of an environmentally adapted 
bacterium not binding to appropriate receptor moieties and in 
subsequent steps, the identification of heterologous genes 
encoding adhesins which bind to receptor moieties occurring 
in said environment, preferably moieties occurring abundant- 
ly, isolating this gene and introducing it into the above 
parent strain. The adhesin gene may e.g. be a gene coding for 
a naturally occurring FimH adhesin or a recombinant FimH 
2 0 adhesin as defined above. 

In one useful embodiment of the method, the adhesin -encoding 
gene is introduced by transforming a parent bacterial cell 
with a recombinant replicon as defined herein. In further 
embodiments, the method is designed so as to obtain a cell 
wherein a non- adhesin compound is associated with the adhes- 
in, e.g. by introducing the gene coding for an adhesin as a 
hybrid gene coding for a non-adhesin polypeptide whereby non- 
adhesin compound is expressed with the adhesin as part of a 
fusion protein comprising the adhesin, or by binding non- 
covalently a compound to the adhesin when expressed. 

Besides the above method, an adhesin carrying bacterial cell 
having an altered pattern of adhesion can be provided by 
using a selection procedure comprising contacting an appro- 
priately sized population of wild- type adhesin- carrying 
bacterial cells with a potential receptor moiety to which the 
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wild- type cells do not adhere, e.g. in a manner as it is 
disclosed in Example 6 below whereby spontaneously or random- 
ly mutated cells having acquired the ability to adhere to the 
receptor moiety in question, become progressively enriched 
» From such an enriched culture, cells with the new adhesion 
ability can readily be isolated and further characterized. 

As it has been explained in details above, one primary objec- 
tive of the present invention is to provide the means of 
targeting a compound to a specific location. Accordingly, the 
invention relates in an important aspect to a method of 
targeting an adhesin to such a location. The method comprises 
the identification in the location of a receptor moiety, said 
moiety preferably being one which occurs abundantly in the 
particular location, which moiety can recognize and interact 
with an adhesin, and the isolation of a bacterial cell which 
xs capable of growing in the location and expressing an 
adhesin which recognizes and interacts with the identified 
receptor moiety, and administering the cell or the adhesin in 
an isolated form to that location. 

The identification of a suitable receptor moiety in a par- 
ticular location can be carried out in several manners. One 
example is a screening procedure where cells expressing known 
adhesins or known isolated adhesins are administered to the 
location e.g. being isolated cells or tissues of microbial 
animal or plant origin or an inanimate surface as defined 
herein, and screening for binding/adhesion of the tested 
adhesins e.g. according to adhesion assays as disclosed 
herein. If binding of one or more adhesins occurs, it is an 
indication that receptor moieties for that or those tested 
adhesin(s), is/are present in the location. 

Alternatively, available data with regard to the presence and 
amounts of chemical moieties present on the surfaces of the 
location may be collected or such data have to be generated 
and based upon such data, adhesins which are known to bind to 
one or more of the identified major moieties are selected and 
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their binding to this/these structure (s) is tested e.g. 
according to the assays as used herein. Chemical moieties 
which are considered potential adhesin- interacting receptor 
moieties include as examples glycolipids, glycoproteins, 
i proteins, polypeptides, saccharide moieties and peptides. 

In the case no suitable chemical moiety is identified in the 
location, which is capable of binding to known adhesins or 
which bind with a sufficient affinity, it is required to 
construct a library of modified adhesin molecules based on 
known adhesins which are modified by replacing one or more 
codons as it is explained herein, and/or such a library 
provided by constructing synthetic adhesin molecules, and 
then screening this library for recognition of and interac- 
tion with identified location surface moieties. A library of 
modified FimH adhesins may e.g. be selected for specificity 
towards a given receptor by running clones of these adhesins 
through a column or matrix containing the receptor moiety in 
question or cells or tissues isolated from the location 
without knowing what the receptor moiety is. The clone (s) 
expressing the adhesins with affinity to receptor moie- 
ty/moieties will adhere/bind to the column or matrix, and can 
subsequently be isolated therefrom. 

It is within the contemplation of the invention that crystal - 
lographic analyses of adhesins, whether naturally occurring 
or constructed as indicated above, is a useful technique for 
the obtainment of information about adhesin structures that 
assumingly will recognize and interact with particular adhes- 
in receptor moieties. 

In accordance with the invention, one embodiment of the above 
method is one wherein the isolated bacterial cell expresses 
an adhesin having modified receptor moiety- binding properties 
relative to an adhesin natively expressed by the cell, the 
isolation of the cell comprising identifying in a parent 
bacterial cell, DNA sequence (s) coding for the binding 
domain(s) of said natively expressed adhesin and substituting 
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at least one codon herein, whereby a modified adhesin mole- 
cule is expressed that is different in at least one amino 
acid from the adhesin expressed natively, and selecting a 
bacterial cell expressing the modified adhesin having an 
altered adhesion phenotype relative to the natively expressed 
bacterial adhesin or a method wherein the bacterial cell 
expressing an adhesin that recognizes and binds to the 
receptor moiety is a recombinant bacterial cell derived from 
a parent bacterial cell that does not produce an adhesin 
binding to said receptor, by inserting into the parent cell a 
DNA sequence coding for an adhesin binding to the receptor 
moiety, and selecting a bacterial cell expressing the DNA 
sequence. 

One primary objective of the present invention is the target- 
ing of useful non-adhesin compounds to a particular location. 
Accordingly, the invention encompasses in an interesting 
embodiment a method as defined above wherein a non-adhesin 
compound is associated with the adhesin, whereby said com- 
pound is targeted with the adhesin to the location comprising 
the receptor moieties recognizable by the adhesin. 
The compound can be associated with the adhesin by a covalent 
binding or by any of the above mentioned non- covalent types 
of molecule interaction forces. 

When associated covalently with the adhesin the compound to 
25 be co- targeted to the selected location with the adhesin can 
be an enzyme, an antibody, an epitope or a toxin which is 
part of a fusion protein comprising the adhesin. A compound 
which is associated with the adhesin by a non-covalent bind- 
ing is typically a pharmacologically active, diagnostically 
30 active or imaging compound. 

Locations to which it is desirable to have an adhesin tar- 
geted by the present method include a human or animal sur- 
face, a plant surface and an inanimate surface as defined 
above . 
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In one specific embodiment of the present method the bacteri- 
al cell being administered to the location expresses a recom- 
binant bacterial adhesin variant derived from a naturally 
occurring parent adhesin, said adhesin variant having altered 
binding properties relative to the naturally occurring adhes- 
in from which it is derived, the altered binding properties 
including binding to at least one receptor moiety to which 
the parent adhesin does not bind. Such an adhesin variant 'is 
advantageously derived from a naturally occurring adhesin 
isolated from a cell structure selected from the group con- 
sisting of a capsule, a lipopolysaccharide layer, on outer 
membrane protein, a flagellum, a pilus, a fimbria, a non- 
f imbrial adhesin (NPA) or an af imbrial adhesin (AFA) . 

In specific embodiments of the invention, the above adhesin 
variant as used in the present method is a protein having an 
amino acid sequence differing in at least one amino acid 
residue from its parent protein adhesin such as a FimH adhes- 
in having an amino acid sequence which differs from the E. 
coll PC31 FimH adhesin as defined herein in at least one 
amino acid. Such a FimH adhesin includes an adhesin which 
binds to a receptor selected from the group consisting of a 
domain where mannosyl residues are not terminal and a domain 
devoid of saccharide and an adhesin variant which is at least 
90% homologous to the PC31 FimH adhesin as defined herein, 
such as at least 92% homologous, more preferably at least' 93% 
homologous, even more preferably at least 94% homologous 
most preferably at least 95% homologous, and in particular at 
least 96% homologous, e.g. at least 97% homologous, in par- 
ticularly interesting embodiments, the adhesin is at least 
98% homologous, including at least 99% homologous or at least 
99.5% homol ogous . 

The above FimH adhesin can be a chimeric adhesin as defined 
above, comprising amino acid sequences from different FimH 
adhesins and constructed according to the methods below. 
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In accordance with the invention, an adhesin can be adminis- 
tered to a location in the form of an adhesin- expressing 
bacterial cell. Such a cell is one capable of growing in that 
particular location. Accordingly, the bacterial cell is 
suitably derived from a bacterial species which is normally 
occurring in the location including human or animal body 
surfaces, plant surfaces such as plant root surfaces and 
inanimate surfaces. In this context, an animal body surface 
includes the insect gut, whereto it is desirable to adminis- 
ter a bacterial cell expressing an insecticidally active 
toxin. 



Thus, if it is desired to administer the bacterial cell to 
the root of a plant, a suitable bacterial cell is preferably 
isolated from a strain which has colonized the rhizosphere of 
15 that plant to a large degree, i.e. the strain is a major 
member of the natural plant root flora. Such an isolate is 
then provided with a gene coding for an adhesin which will 
recognize and interact with an abundantly occurring moiety on 
the roots of said plant. In this manner, a suitable adhesin 
which is expressed naturally in a bacterium which is not 
adapted to grow in a plant rhizosphere, becomes expressible 
in a normal inhabitant of the rhizosphere environment (loca- 



tion) . 



In specific embodiments of the present method of targeting a 
bacterial adhesin to a specific location, the adhesin is a 
FimH adhesin as defined above, having an amino acid sequence 
which differs from the E. coli PC31 FimH adhesin as defined 
herein in at least one amino acid. 



30 



35 



In an interesting embodiment, the adhesin- carrying bacterial 
cell being targeted is a cell which further comprises a gene 
coding for a gene product which, when it is expressed, has a 
killing or cell function- limiting effect in said cell, the 
expression of said gene coding for. the cell killing or cell 
function- limiting gene product being regulated in such a 
manner that the bacterial cell, when targeted, will be killed 
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or limited in its function in a pre -determined manner. The 
expression of such a "suicide" or cell function- limiting gene 
may suitably be regulated by a factor selected from the group 
consisting of a stochastic event, the presence/absence of a 
chemical compound in the location and a physical factor. As 
examples of such "suicide" or cell function- limiting genes 
providing the means of biological containment, may be men- 
tioned those disclosed in WO 87/5932 and WO 93/20211 

Furthermore, the present Designer Adhesin Technology (DAT) 
provides very useful means of obtaining colonization with 
desirable bacteria in a particular environment with the 
purpose of obtaining beneficial changes of the microbial 
flora in the environment. As one example, certain bacterial 
species in the gastrointestinal (GI) tract of humans and 
animals have beneficial effects on the health condition of 
the host organism e.g. by suppressing pathogenic organisms or 
by contributing to the digesting of certain diet components. 
The present technology makes it possible to select particu- 
larly useful bacteria from the GI- tract and have them 
designed in accordance with the present invention, to have 
improved colonization abilities. Similar examples include 
desirable bacterial colonizations of biological sewage puri- 
fication systems, plants where invasion of pathogenic organ- 
isms may be controlled by colonizing the plants with harmless 
bacteria, and teeth where caries may be controlled by coloni- 
zing the dental enamel with bacteria suppressing those caus- 
ing the caries attacks. 



In another industrially interesting aspect, the invention 
provides the means of isolating a compound from a solution or 
30 suspension containing the compound. The method comprising 
contacting the solution or the suspension with a fusion 
protein as defined herein wherein the heterologous 
polypeptide has an affinity to the compound to be isolated. 

Furthermore, the invention provides a composition comprising 
35 a population of a bacterial cell as defined herein. 
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The invention is further illustrated in the below Examples 
and the Figures, wherein 

Fig. 1 is a schematic model for the construction of recom- 
binant plasmids pGBl-24 (containing fimH from CI #10) and 
PGB2-24 (containing fimH from PC31) used for transforming E. 
coli AAEC191A( P PKL114) with cloned fimH genes. Plasmid P GB2- 
24 was used as the vector for all other cloned fimH genes 
described herein; 

Fig. 2 is a restriction map of fimH genes. Five unique res- 
triction sites are present in the PC31 fimH gene. Numbers in 
parentheses following enzymes are the base pair positions of 
the cut sites. Some of these sites are found in the other 
fimH genes, as marked. Chimeric genes were produced by ex- 
changing each available restriction fragment from the other 
five fimH genes with corresponding fragments in the PC31 gene 
and then recombinant strains expressing resulting chimeric 
fimH subunits were tested for adhesion. Fragments indicated 
by boxes are those which conferred MF or MFP adhesive 
phenotypes on the chimeric genes; 

20 Fig. 3 illustrates adhesion of representative "wild- type" (A) 
and recombinant (B) M-class, MF-class and MFP-class strains 
to Mn (1), Fn (2), periodate- treated Fn (3) and to FnSpl (4). 
Strain designations given for the "wild- type" strains are 
given in AS. Strain designations KB31, KB12, KB4 , KB7, KB50 

25 and KB10, are for recombinant strains of AAEC19iA(pPKLll4) , 
which is fimH-, after transformation with plasmids that 
contain fimH*- from strains HB101 (pPKL4) , CI #12, CI #4, ci 
#7, CSH-50 and CI #10, respectively. Open columns indicate 
results when bacteria were incubated in buffer without D- 

30 mannose, while solid columns are results in the presence of 
D-mannose. Values indicated are the mean ± S.E.M. (n=4) for 
each column; 

Fig. 4 illustrates the adhesion of representative M-class, 
MF-class and MFP-class strains (CIs #12, #4 and #10, respect- 
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ively) to Fn fragments prepared by thermolysin treatment as 
described in ref. 51. Columns labelled 1-5 indicate adhesion 
to: 1) NH2-terminal 30-kDa domain; 2) the 55-kDa gelatin- 
binding domain; 3) the 110-kDa cell attachment domain; 4) the 
29-38-kDa heparin binding domains; and 5) the 20-kDa COOH- 
terminal domain. Open columns represent adhesion in the 
absence of D-mannose; solid columns represent adhesion in the 
presence of D-mannose. Mean + S.E.M. (n=4) ; 

Fig. 5 is a composite figure illustrating comparison of amino 
acid sequences of FimH adhesins and active restriction frag- 
ments of fimH genes. The published nucleotide and deduced 
amino acid sequence of the PC31 fimH gene and gene product 
(ref. 27) serve as prototype. Numbered amino acid residues 
shown above the model of the PC31 FimH represent residues 
15 that are different in other FimH subunits due to amino acid 
substitution or deletion. Standard one-letter code applies 
and residues in the other FimH sequences that are different 
are indicated. Deleted amino acids are indicated by A. It 
should be noted that residue 176 is not arginine as published 
20 previously (ref. 27) for the PC31 FimH, but proline. Regions 
of the FimH subunits conferring change in adhesive phenotype, 
highlighted in bold, were determined by functional assays 
performed on chimeras between the "classic" mannose- specif ic 
PC31 fimH gene present in HBl0l(pPKL4) and the above 
described genes. Residues predicted to be key in conferring 
receptor specificity are circled. Approximate positions of 
unique restriction sites used to create chimeras are indi- 
cated along the bottom of the model; 



25 



30 



Fig. 6 illustrates plasmid pPKL4 which is a derivative of 
PBR322 (thick line) carrying the entire fim operon (FimA-H) 
including the regulatory genes fimB and fimE (not shown) , and 
the promoter region with the SnaBl site. In this plasmid an 
8mer linker with an B^IH site was inserted in the SnaBI site 
to create pPKL83; 
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Fi 9- 7 illustrates the construction of plasmid pSMl314; the 
vector pVLT33 is a derivative of the broad host range replic- 
on RSF1010. Plasmid pPKL83 was digested with Bgrlll and pVLT 
was digested with BamHl; the two were ligated and pSMl314 was 
the resulting plasmid in which expression of the fimA-H 
cluster is under the control of the tac promoter; 

Fig. 8 illustrates plasmid pLPA22 and derivatives hereof as 
used in this study. The triangles indicate the position of 
translational stop -linkers in the fimH gene in plasmid 
PPKL115. The positions of heterologous inserts are indicated 
(black boxes) . Small triangles indicate signal -peptide encod- 
ing sectors. 



Fig. 9 illustrates plasmids pLPA29, pLPA30, pLPA36, pLPA58, 
pLPA59 and pLPA98; 

15 Fig. 10 shows immuno- electron microscopy with colloid gold 
labelling of E. coli HB101 cells containing plasmids pLPA22 
plus pPKL115 (a) , pLPA37 plus pPKL115 (b) , pLPA38 plus 
PPKL115 (c), using anti-pre-S2 monoclonal antiserum. Bar, 0.1 

fJLOl. 

20 EXAMPLE 1 

Functional heterogeneity of tvnp i f imbria! adhesina rh,e m 
minor sequence va riations among fimH genes 

1.1. M aterials and methods 

1.1.2. Reagents 



25 Yeast Mn, a polymannosylated glycoprotein isolated from 
Saccharomyces cerevisiae cell walls, was obtained from a 
commercial source (Sigma Chemical Co, St. Louis, MO, U.S.A.) 
Mannan is composed of an N- linked backbone of 01, 2 -linked 
mannopyranose units with or- linked mannopyranose side chains 

30 (ref. 38) . The majority of the carbohydrate of human plasma 
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Fn is composed of N-glycosidic complex- type biantennary 
glycans and no high mannose-type or hybrid- type N-glycans 
have been described (refs. 30, 45, 54). Human plasma Fn and 
Fn fragments were purified as described previously (refs. 5, 
15, 51, 58). Periodate treatment was performed as described 
previously (ref. 51). The synthetic peptide, FnSpl, copying 
the first 30 amino acid residues of the Fn molecule (EAQQMVQ- 
PQSPVAVSQSKPGCYDNGKHYQI) was synthesized in the Protein 
Chemistry Laboratory of the VA Medical Center, Memphis, TN 
(SEQ ID NO:2). The saccharide content of the four substrata 
was characterized using two lectins, concanavalin A (ConA) , 
well known to react with terminal and internal mannosyl 
residues, and the Calanthus nivalis agglutinin (GNA) , which 
recognizes only terminal Manal-3Man, Manal-6Man and Mano-1- 
2Man sequences (E. Y. Laboratories, San Mateo, CA) . Immobi- 
lized Mn and Fn both reacted with ConA, whereas GNA bound 
only to Mn. These results are consistent with the known 
structures of the oligosaccharide moieties of these two 
compounds. Neither lectin reacted with immobilized FnSpl. 
Periodate treatment (ref. 51) of Mn or Fn eliminated lectin 
reactivity. 



25 



1.1.3. Bacterial st rains anrt, plasm-Ma 

The CSH-50 strain (lambda" , F'araA (lac -pro) rspL thi 
f±mE::lSl) is a Cold Spring Harbor K12-derived strain (ref. 
35). The E. coli strain MG 1655 (CGSC6300; K12 derivative, 
lambda", F") and a derivative strain AAEC191A (MG1655 recA ' 
Afim were generously provided by Dr. Ian Blomfield (Bowman 
Gray University, Wins ton -Salem, NC) . AAEC191A has had the 
entire flm gene cluster deleted by allelic exchange (ref. 8). 

30 Clinical isolates (CIs) were urinary tract isolates obtained 
from the clinical microbiology laboratories of the Memphis VA 
Medical Center or The City of Memphis Hospitals, Memphis, TN. 
The 12 CIs used in this study were selected on the basis of 
MS agglutination of yeast cells after growth in broth, a 

35 classic test for type l fimbriae. 
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Plasmid pPKL4, a pBR322 derivative containing the entire fim 
gene cluster from E. coll strain PC31 (K12 -derivative, gal 
toriA phx ara) and encoding for the expression of fully func- 
tional type 1 fimbriae in HB101 (supE hadS recA ara proA lacY 
5 galK rspL xyl mtl AfimBE) , has been described previously 
(ref . 28) . pPKLH4 is a recombinant plasmid derived from 
pPKL4, but with a translational stop- linker inserted into the 
Kpnl site in the flmH gene. No transcriptional effects of the 

stop- linker are to be expected. Antibiotics were used at the 
10 following final concentrations: ampicillin (50 /xg/ml) , 
kanamycin (60 fig /ml) and chloramphenicol (30 /xg/ml ) . 



1.4. Polymera se chain react 



ion 



Oligonucleotide primers were designed using the published 
sequence for the fimH gene in pPKL4 (ref. 27) . The 5' primers 
15 copied regions 13 and 49 bp upstream from the flmH gene and 
were extended on the 5' end by an Apall restriction site and 
a GC clamp: Primer 1: 5 ' -GGGGG-GTGCAC-ACC TAC AGC TGA ACC 
CGG-3' (SEQ ID NO:3); Primer 2: 5 ' -GGGG GTGCAC T CAG GGA ACC 
ATT CAG GCA-3' (SEQ ID NO: 4). The 3' primers copied 18 bases 
20 of the bottom strand of the flmH gene that encode for the 6 
terminal amino acids of fimH and were extended by an Fspl or 
Spill site and a GC clamp: Primer 3: 5'-GGG TGCGCA TTA TTG 
ATA AAC AAA AGT CAC - 3' (SEQ ID NO: 5); Primer 4: 5'-GGG 
GCATGC TTA TTG ATA AAC AAA AGT CAC- 3' (SEQ ID NO: 6) . Primer 1 
25 and 3 were used for CI #10 and pPKL4, primer l and 4 were 

used for CI #4 and CSH-50 and primer 2 and 4 were used for CI 
#s 7 and 12 to generate PCR products from plasmid or 
chromosomal DNA prepared from E. coli expressing different 
functional classes of type 1 fimbriae. The PCR reaction 
30 mixture consisted of template DNA, primer pairs, dNTPs, and 
Taq DNA polymerase in PCR buffer. The PCR was performed in a 
Perkin- Elmer Cetus automatic thermal cycler with denaturation 
at 96°C for 1 min., primer annealing at 55°C for 1 min., and 
primer extension at 72 °C for 2 mins. for a total of 40 
35 cycles. All of the PCR products migrated similarly in agarose 
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gels. Purification, restriction and ligation of DNA was 
performed using standard procedures (refs. 39, 48). All 
primers for PCR and for nucleotide sequencing were produced 
by the Molecular Resources Center, UT, Memphis. 

5 1.1.5. Snhrl oning 

The PCR products from CI#10 and from pPKL4 were cut with 
respective restriction enzymes and ligated into the Apali and 
Fspl restriction sites of plasmid pACYC177 (New England 
Biolabs, Beverly, MA, U.S.A.) which is compatible with the 
10 pBR322 -based pPKLll4 to be used in complementation experi- 
ments, creating plasmids pGBl and pGB2, respectively (Fig. 
1) . However, it became inconvenient to use pACYC177 -based 
plasmids because of a high frequency of appearance of sponta- 
neous Km r in the AAEC191A host strain. The origin of this 
problem is not entirely clear, but it was avoided by subclon- 
ing the fimH genes from pGBl and pGB2. The inserts and 
upstream regions of P ACYC177 containing the tet promoter were 
cut from pGBl and pGB2 with Fspl and BamHl and subcloned into 
the polyl inker site of pGEM-32 (Promega, Madison, WI) that 
had been cut with BamHl and Hinc2, creating plasmids pGBH 
and pGB2-i respectively. pGEM-3Z was simply used as a con- 
venient intermediate in subcloning into pACYC184. 

The inserts were cut out again using final and Hind3 and 
subcloned into pACYC184 (New England Biolabs, Beverly, MA) 
25 cut with Hinc2 and tfind3, creating plasmids pGBl-2 and pGB2- 
24 containing the fimH genes from CI#10 and pPKL4, respect- 
ively. These plasmids complement the non- adhesive defect of 
AAEC191A( P PKL114) giving the adhesive phenotypes of the 
parental strains (see Results) . Cutting the fimH gene from 
PGB2-24 using Apall and Sphl makes it possible to easily 
insert other fimH genes obtained by amplifying chromosomal 
DNA of other isolates by PCR. All recombinant strains we have 
tested thus far using this technique exhibit the same adhes- 
ive phenotype as the parent strains from which the fimH genes 
35 were cloned. 



20 
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1.1.6. Consfruetinn of chimeric fimH creneg 

Unique restriction sites (Fig. 2) were used to construct 
chimeric fimH genes between the prototypical MS pPKL4 fimH 
gene, used as genetic background, and restriction fragments 
obtained from the newly described fimH genes. Fragments were 
purified from agarose gels and ligated into restriction 
"spaces- generated in the pPKL4 fimH gene present in pACYCl'84 
(PGB2-24). Each chimera was analyzed by restriction mapping 
and the nucleotide sequences of bridging segments were deter- 
mined to ensure proper constructions. The plasmids containing 
chimeric fimH genes were transformed into AAEC191A(pPKLll4) 
and clones were tested for agglutination of yeast cells and 
for adhesion to Mn, Fn and FnSpl. 

1.1.7. Nucle otide sequencing 

The nucleotide sequences of fimH genes were determined by the 
dxdeoxynucleotide chain termination method of Sanger (ref . 
49) using a Sequenase II® kit (U.S. Biochemical Corp., Cleve- 
land, Ohio) and [or-^sidATP (800 tQ 1000 ci/mmol) according 
to the manufacturer's suggestions. The amino acid sequences 
were deduced from nucleotide sequences using MacVector® 
protein and DNA analysis software (Eastman Kodak, Rochester 
NY) . To ensure fidelity of the PGR amplification, selected 
fimH genes were re-amplified, cloned, tested for activity and 
re-sequenced. More recently, we have used the £mol» Polymer- 
ase Sequencing System (Promega, Madison, WI) , because it is 
useful with small amounts of DNA and thus subcloning the fimH 
genes from the pACYCl 84 -based plasmids to high copy number 
plasmids was obviated. Bands were visualized by autoradio- 
graphy of sequencing gels and compared with the published 
fimH gene sequence (ref. 27) . 



1,1,8. Yeast cell acrq-r eaa t- i on ac.o=.y 
E 



coli were tested for their ability to aggregate yeast 
cells. Commercial baker's yeast, Saccharomyyces cerevisiae, 
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was suspended in PBS (5 mg dry weight /ml) . E. coli were 
washed in PBS, resuspended to an OD 530 of 0.4, and mixed with 
the yeast cell suspension in PBS with or without 1% D- 
mannose. Aggregation was monitored visually and the titer 
recorded as the last dilution giving a positive aggregation 
reaction. 

1.1.9. Adhegjon assays 
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20 



Adhesion assays were performed as described previously (ref . 
51) . Briefly, microtiter assay wells were coated with 
receptor molecules as indicated in the text and figure leg- 
ends. After the wells were washed two times with PBS, 100 /xl 
bacterial suspensions were added in 0.1% BSA-PBS. After 
incubation at 37°C for indicated times, wells were washed 
three times with PBS and adherent bacteria were detected by 
15 using rabbit anti-E. coli serum. Antibody binding was 

detected using peroxidase -conjugated goat anti-Rabbit igG. 
Reaction product generated from the 5 -aminosalicylic acid 
substrate was measured at 405 ran after 10-15 minutes by using 
an automatic microplate reader (Molecular Devices, Inc., 
Menlo Park, CA) . Values reported are corrected for background 
reaction using BSA coated plates as control. 

1.2. Results 

In a previous publication it was reported that type 1 
fimbriae of E. coli CSH-50 and HB101 (pPKL4) differ func- 
tionally in their pattern of adhesion to Mn, Fn, periodate- 
treated Fn and a synthetic peptide, FnSpl, immobilized on 
plastic (ref. 51). Since CSH-50 and HBl01(pPKL4) are labora- 
tory strains, we tested 12 clinical E. coli isolates (CIs) 
obtained from human urine for adhesion to these four substra- 
ta. All of the CIs agglutinated yeast cells in a MS fashion. 
Five of the twelve CIs adhered only to Mn. The adhesive 
activity of HBl01(pPKL4) and of CI #12 are shown as examples 
of this class, which we have tentatively designated as M 
class (Fig. 3 A) . Three of the 12 CIs adhered to Mn and Fn, 
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but not to periodate- treated Fn or to FnSpl. The adhesive 
activities of CI #s 4 and 7 are shown as examples of this 
class, designated as MF class. Three of the twelve CIs 
adhered to each of the substrata. The adhesive activities of 
CSH-50 and CI #10 are shown as examples of this class, desig- 
nated as MEP class. 



10 



Adhesion of strains representing these three classes to Fn 
fragments further illustrates the distinct differences 
between the three classes. The M class CI #12 does not adhere 
to any of the Fn fragments (Fig. 4) . The MF class CI #4 
adheres to the 40-kDa gelatin-binding fragment. The MFP class 
CI #10 adheres, with only slight differences, to all 5 frag- 
ments of Fn tested. Periodate treatment eliminated binding of 
CI #4 to domain 2, but had no effect on the binding of CI #10 
15 to any of the Fn domains (data not shown) . 

Since the fimH subunit has been shown to mediate the mannose- 
sensitive activity of type 1 fimbriae, we focused our initial 
efforts to elucidate the molecular basis for the observed 
functional heterogeneity on the fimH gene. fiM genes were 
20 amplified from chromosomal (or plasmid, for pPKL4) DNA and 
the genes were cloned into pACYCl77 and subcloned into 
PACYC184 under control of the 0- lactamase promoter of 
PACYC177, according to Materials and Methods (Fig. l) 



25 



The adhesive phenotypes conferred by the fimH genes were 
tested in the following way. E. coli K-12 strain AAEC191A 
(Afim) was first transformed with plasmid pPKL114, which 
contains an intact fim gene cluster but with a translational 
stop- linker inserted into the last gene, fimH. This deriva- 
tive produces morphologically normal fimbriae that are non- 
30 adhesive due to absence of the FimH subunit. Plasmids har- 
bouring cloned fimH genes were transformed into E. coli 
AAEC191A( P PKL114) and the resultant strains were tested for 
their ability to adhere to Mn, Fn f periodate- treated Fn and 
to FnSpl (Fig. 3B) . Each of the recombinant strains displayed 
adhesive phenotypes mimicking those of the representative 
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parent strains from which the fimH genes were obtained. fimH 
genes were cloned from each of the other 8 CIs and similar 
results were obtained with the adhesion of recombinant 
strains mimicking that exhibited by the parental CIs. 

The complete nucleotide sequences of each of the six repre- 
sentative fimH genes were determined and the amino acid 
sequences of the fimH proteins were deduced as it is shown ir 
Table l below which is a representation of amino acid 
sequences of the FimH subunits deduced from nucleotide 
sequences of selected fimH genes disclosed in this example 
[CI#3 (SEQ ID NO:33), CI#4 (SEQ ID NO:29), CI#7 (SEQ ID 
N0:30), CI#10 (SEQ ID N0:3l) and CI#12 (SEQ ID NO:28)] and 
those of the E. coli K12 strain PC31 (SEQ ID N0:1) and E. 
coli strain CSH-50 (SEQ ID NO:32). Additionally, the FimH 
amino acid sequences of the following clinical isolates of E. 
coli are shown; KB21 (SEQ ID N0.-27) , KS54 (SEQ ID NO:35) , 
U221-3 (SEQ ID NO:36), MJ#9-3 (SEQ ID NO:37), MJ#31-3 (SEQ ID 
NO:38), MJ#ll-2 (SEQ ID NO:39), MJ#2-2 (SEQ ID N0:1) and F-18 
(SEQ ID NO:34) . Standard one- letter code applies. Deleted 
amino acid residues are indicated by As. M, M L , MF, MFP, and 
M R indicate the adhesin class as defined above. 
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Table 1, Amin o acid sequences of the FimH proteins deduced 
from nucleoti de sequences of fimH genes of clinical isolates 
disclosed in t his example and of E. coll K12 strains PC31 and 
CSH-50 
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and gene products should also be identical, subunit incorpor- 
ation into the fimbrial superstructure should not vary sig- 
nificantly. These results emphasize that in these experiments 
it is the FimH subunit that determines receptor specificity. 

5 In comparing the new FimH sequences to the one published 

previously (ref. 27), the only structural alteration that can 
be clearly linked to a functional change, without resorting 
to analysis of chimeric fimH genes, is the non- conservative 
substitution of arginine 58 in the MFP class CSH-50 FimH 
10 subunit for leucine 58 in the M class PC31 FimH subunit. Since 
each of the other FimH sequences had more than one change, it 
was necessary to construct chimeric genes to begin to focus 
on functionally relevant changes. 

In the case of the CI #10 FimH, an MFP class adhesive activ- 
15 ity similar to that of CSH-50 is conferred by a different 
region of the gene which encodes for a subunit deleted of 
residues 116-119. It remains to be determined how two dis- 
tinctly different structural changes can bring about appar- 
ently similar changes in receptor specificity. It is pos- 
20 sible, of course, that as additional receptor molecules are 
tested, these two variants will be found to be functionally 
distinct. 

The Apall-Tthlliz fragment of the CI #7 fimH gene confers MF 
class activity in the CI#7/PC3l fimH chimera. Since the 

25 asparagine 16 - threonine 16 substitution is within the leader 

sequence and thus not represented in the mature protein, the 
histidine 33 - asparagine 33 substitution must be of functional 
importance for the MF class CI #7 FimH. Comparison of the 
active regions of the MF class CI #4 and the M class CI #12 

30 FimH subunits suggests the importance of the glutamic acid 73 - 
glycine 73 substitution for MF class activity of the CI #4 
FimH. Thus, histidine 33 , arginine 58 , glutamic acid 37 and 
deleted glycine 116 - isoleucine 119 appear to be key residues in 
the functional activity of FimH subunits of CI #7, CSH-50, CI 

35 #4 and CI #10, respectively, but a more precise demonstration 
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48 

of which residues are involved and how they affect the 
ligand- binding cleft (s) remains to be performed. 

At first glance, the FimH mediated, mannose- sensitive pro- 
tein-binding activity of type 1 fimbriae is the most surpris- 
ing of the adhesive phenotypes described here. However, 
protein- binding activity of FimH (i.e. PilE) subunits was 
noted earlier in a study characterizing mutT- induced muta- 
tions in the fizaH (pilE) gene (Harris et al., ref. 22) How- 
ever, the protein- binding activity described by Harris et al. 
was not mannose- sensitive. It is presently not known whether 
the protein-binding activity described herein is in addition 
to or separate from the mannose -binding activity, but the 
concept of bifunctional properties of lectins has been estab- 
lished for several years (ref. 6). While the MFP class type l 
fimbriae appears to react somewhat promiscuously with most Fn 
fragments, the reaction does not appear to be non-specific. 
For instance, the MFP class CSH-50 type 1 fimbriae do not 
adhere well to gelatin (ref. 51) . Further, the adhesion to 
ovalbumin is sensitive to both periodate and glycosidase 
20 treatment (ref. 51). Further work will be required to deter- 
mine the consensus amino acid motif reactive with this class 
of FimH subunit. 

Previous studies suggested that the combining site of the E. 
coli FimH adhesin is in the form of an extended pocket cor- 
25 responding to the size of a trisaccharide with an associated 
hydrophobic region (ref. 16). The MS nature of all of the 
adhesive interactions described suggests that if the combin- 
ing sites are separate, they may be close to each other. 
However, it remains to be determined whether or not the 
mannose effect is direct or allosteric. Conformational 
changes that frequently occur in lectins upon binding the 
saccharide ligand (ref. 46) could affect a second, distant 
binding site. Site-directed mutations may be sufficient to 
clarify which structural changes result in changes in 
receptor specificity. However, such studies are unlikely to 
shed much light on how the structural changes actually relate 
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Table 1, continued 
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The nucleotide and deduced amino acid sequences of the pPKL4 
fimH gene are identical to that reported previously, except 
that residue 176 is not an alanine residue as previously 
reported, but a proline residue. Independent re-amplifica- 
tion, re- cloning and re - sequencing confirmed this sequence 
for the pPKL4 fimH gene. Sequencing was also repeated on 
independently amplified and cloned isolates of the CI #10 and 
CI #7 fimH genes to confirm sequence fidelity and no errors 
were found. 



The nucleotide and deduced amino acid sequences of the other 
fimH alleles described in this Example are > 98% conserved, 
but there is more than one amino acid residue difference in 
all but one of the new fimH sequences when compared to the 
published P PKL4 sequence. To focus on the sequence differ- 
ences that resulted in changes in functional activity, advan- 
tage of unique restriction sites were taken (Fig. 2) to 
construct chimeric fimH genes. Multiple restriction fragments 
covering the entirety of each of the sequenced fimH genes 
were exchanged with corresponding fragments in the prototypi- 
cal fimH gene of E. coli strain PC31 that was amplified from 
PPKL4, cloned into pACYC184 and used as the genetic back- 
ground. Recombinant plasmids containing the chimeric fimH 
genes were transformed into E. coli AAEC19lA(pPKLll4) and 
transformants were tested for adhesive phenotype, allowing 
25 determination of the regions of each gene capable of confer- 
ring functional activity (Fig. 5 ) . All of the sequence 
changes that affected function occurred between residues 33 
and 119 of the 279 residue mature fimH sequence. 



1.3. Discussion 



The functional heterogeneity which is described above must be 
due entirely to allelic variants of the fimH gene. The only 
variables in the recombinant strains which are described in 
this Example are the fimH genes; all other genes necessary 
for fimbrial subunit synthesis, transport and assembly are 
35 the same in each case. Since the ratios of the various genes 
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to the ligand- binding cleft (s) and it will ultimately be 
necessary to determine the 3 -dimensional structure of FimH or 
FimH fragments crystallized in the presence of ligand to 
fully understand structure/function relationships. 

The three adhesive classes of type l fimbriae identified 
above may understate the functional heterogeneity of type l 
fimbriae. The group of CIs that has been tested in this 
Example is small and only a few substances have been tested 
as potential receptors. A larger group of isolates tested 
against additional receptor candidates might yield additional 
functional classes. Preliminary studies with MS Enterobacter 
aerogenes and Klebsiella, pneumoniae strains exhibiting MF 
class and MFP class activity suggest that heterogeneous 
receptor specificities will also be found among other type l 
IS fimbriated enterobacterial species. 

It is also believed that it is possible that adhesins from 
some fimbriae responsible for mannose- resistant hemagglutina- 
tion or adhesion are structurally related to FimH, but with 
sequence alterations that eliminate sensitivity to mannose. 
The possibility that the MS lectin-like properties of FimH 
might be eliminated while retaining other adhesive properties 
of FimH (e.g. pellicle formation) has been shown previously 
in a study characterizing mutT- induced mutations in the fimH 
(pilE) gene (ref. 26). At the minimum, it is believed that 
tests for type l fimbriation should include additional func- 
tional characterization. While all type 1 fimbriae -mediated 
adhesion which have been described in this Example is 
mannose- sensitive , it is not all mannose- or even saccharide- 
specific as has commonly been thought. Further studies of 
type l fimbriae as a virulence factor must be able to distin- 
guish among the various functional classes. 
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30 



Allelic variation of the so-called G adhesins of P fimbriated 
uropathogenic E. coli also results in different functional 
classes, but the requirement for the Galal-4Gal sequence 
35 within isoreceptors is maintained (refs. 52, 53). These 
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differences in G adhesin receptor specificity appear to be 
rather subtle, at least superficially, when compared to the 
differences in FimH receptor specificities. Yet there is 
significantly greater sequence homology among the fimH genes 
than among the G adhesin genes, some of which share less than 
50 percent homology. The G adhesin receptor specificities 
affect host susceptibility, due in large part to host -speci- 
fic expression of glycolipid isoreceptor variants. Whether- 
the FimH family of adhesins bears a similar relationship to 
host susceptibility or tissue tropism remains to be deter- 
mined. In this regard it is possible that the G adhesin 
family could exhibit additional receptor specificities not 
restricted to the Galorl-4Gal sequence. The lectin- independent 
affinity of P fimbriae for immobilized Fn is not dependent on 
15 the G adhesin, but on two other minor subunits, E and F, 

neither of which bear significant homology to FimH (refs. 56, 
57) . 

It is important to point out that the degree of functional 
heterogeneity of type 1 fimbriae described in the present 
Example was not appreciated when any of the studies cited 
above were performed. The results of these studies have made 
it clear that structural and functional heterogeneity occurs 
within the class of adhesive organelles commonly referred to 
as MS or type 1 fimbriae and that the adhesive diversity will 
25 lead to a broader spectrum of receptive surfaces for poten- 
tial colonization. The surprising finding that a FimH family 
of adhesins exists may prove to be an important step toward 
unravelling the role(s) type 1 fimbriae may play in the 
ability of enterobacteria to reach their normal habitat or 
gain entry into deeper tissues, where devastating effects can 
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occur. 
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EXAMPLE 2 
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Expression of type 1 fimbriae in heterologous bacterial 
species 

The flm operon of E. coll comprises a cluster of genes 
covering about 8 kb of DNA. This operon has been isolated and 
cloned on plasmids in its entirety. The promoter upstream of 
the flmA gene is located within an invertible DNA sequence, 
which in E. coll leads to a switch on/switch off situation 
for fimbrial synthesis. In one orientation of the invertible 
sequence the promoter is directed towards the flm genes, and 
the cell is fimbriated; in the other orientation the promoter 
is directed in the opposite direction, and the cell is non- 
f imbriated. 

Since the regulation of the switch of the invertible promoter 
sequence is very complex and involves several genes outside 
the flm operon it is far from certain that the switching 
takes place in other bacteria than the enterics. It was 
therefore considered necessary to insert a replacement promo- 
ter for the expression of the flm genes, and as a model for 
gene expression in a number of different bacterial species 
the lac promoter was chosen. This promoter has been shown to 
be active and regulatable in many bacterial species. 

Plasmid pPKL83 is a derivative of pPKL4 (ref . 27) carrying 
the entire flm operon in pBR322, in which the promoter has 

25 been destroyed by inserting a BglZl linker in the SnaB site 
located in the promoter sequence. There is a second B&1TT. 
site in plasmid pPKL83 upstream of the flm operon (Pig. 6) . 
Plasmid pVLT33 (Fig. 7) is a kanamycin resistant derivative 
of the broad host range plasmid RSF1010, carrying the lacl* 

30 gene and the tac promoter placed upstream of a multiple 

cloning site in which a unique SamHl site is placed. The two 
plasmids were ligated together after digestion of pPKL83 with 
Sglii and pVLT33 with BamHl. In one orientation (pSMi3l4) , 
this fusion plasmid will express fimbriae in the presence of 
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IPTG due to the fusion between the fimbrial genes and the lac 
promoter. 

The correct orientation of the fusion plasmid pSMl3l4 was 
verified by transforming it into a strain of E. coli which 
i carries a deletion of the fim operon. Production of fimbriae 
was assayed in two ways: 1) Cell aggregation with fimbrial 
antibodies and 2) ELISA assay of whole cells. The former 
analysis is rather simple: to a small volume (10 fil) of an 
outgrown or IPTG- induced culture of the cells to be tested is 
added a small volume (2 fil) of antibodies raised against 
fimbriae, on a glass slide. After mixing the samples, 
fimbriated cells begin to show cell aggregates which are 
easily observed directly as clumps or under a microscope. No 
clumping was observed with cells of the strain with a fim 
deletion, whereas pSM1314 transf ormants of this strain showed 
clearly detectable cell aggregates. The ELISA analysis of 
whole cells confirmed the aggregation assay. In Table 2 below 
the readings from this type of assay are presented, and they 
show quantitatively the occurrence of Fim antigens on the 
cells as a result of IPTG induction of the pSM1314 carrying 
strain. 



Table 2. Results (duplicate of ELISA assay for tvp» i 
fimbria expressed by PSM1314 -in e . col -i AAEC191 fr>n l jr ) 

0.145/0.164 
1.026/1.260 
0.113/0.095 

Plasmid pSMl3i4 also carries a mob site which allows it to be 
transferred to other gram negative bacteria provided a helper 
plasmid is introduced. This type of transfer is most easily 
30 performed in - triparental " matings in which a donor strain 

(E. coli carrying P SM1314) , a helper strain (E. coll carrying 
a plasmid with conjugation genes) and a recipient strain 
carrying a selectable marker not present in any of the two 



AAEC19 1 (pSM13 14 ) 
AAEC191 (pSM13 14 ) +IPTG 
Blank: 
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other strains, are mixed on a plate (directly or on a fil- 
ter) . After some growth (often overnight) this mixture is 
spread on selective plates with antibiotics that only allow 
the recipient carrying the desired plasmid to grow and form 
5 colonies. 

In the present context, the E. coli strains MC1000 (pSM13l4) 
and MC1000 (pRK2013) and (as recipient) finteroiiacter cloacae 
strain A50 Nal r (ref. 67), were mated. This recipient strain 
is resistant to nalidixic acid. After selection for growth on 
10 plates with kanamycin plus nalidixic acid the resulting 

clones were grown in liquid medium and assayed for the pres- 
ence of fimbriae in the absence/presence of IPTG. The cell 
aggregation assay was employed. 
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This assay showed that fimbriae were produced in the Enterob- 
acter cloacae strain and were present on the cell surface; 
however, full repression of expression from the tac promoter 
was not obtained, most likely due to an increased escape 
synthesis. The results showed that E. coll type l fimbriae 
may be synthesized and processed correctly for pili formation 
on the surfaces of heterologous gram- negative bacterial 
species . 

The plasmid P SM1314 in E. coli HB101 was deposited on 26 
January 1994 with DSM, the Deutsche Sammlung von Mikroor- 
ganismen und Zellkulturen GmbH, (German Collection of Micro- 
organisms and Cell Cultures) , Mascheroder Weg IB, D- 38124 
Braunschweig, Germany, under the accession number DSM 8922. 



EXAMPLE 3 

The construction of fimH- fusion gen^ and the expression nf 
mannoae- s ensitive FimH fusion proteins 

30 Heterologous sequences mimicing the pre-S2 region of the 

hepatitis B viral surface antigen and a neutralizing epitope 
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of the cholera toxin B chain were inserted in two different 
positions in the FimH adhesin of type 1 fimbriae. This was 
carried out by introduction of restriction site handles 
{Bgl II -sites) in the fimH gene, followed by in- frame inser- 
5 tion of heterologous DNA segments encoding the foreign epito- 
pes. In the selected positions such insertions did not sig- 
nificantly alter the adhesive function of the FimH protein, 
since hosts producing hybrid fimbriae that contained the 
chimeric adhesins exhibited adhesion phenotypes and were 

10 normally fimbriated. The heterologous inserts of 52 and 15 
amino acids, respectively, residing in the chimeric FimH 
proteins were recognized by specific sera on the surface of 
the fimbriae on bacterial hosts. The results illustrate the 
possibility of using bacterial adhesins as general presenters 

15 of foreign antigens and epitopes. 



3.1. M aterials and methods 



20 



3.1.1. Bacterial str ain and growth conditions 

The Escherichia coli K12 strain HB101 was used in this study 
as a host for expression of chimeric fimbriae. This strain is 
phenotypically Fim" due to a deletion in the fim gene cluster 
(ref . 8) . Cells were grown on solid medium or in liquid broth 
supplemented with appropriate antibiotics. When required, 
gene expression from the lac promoter, residing in front of 
the fimH gene in plasmid pLPA22 and its derivatives, was 
25 induced by the addition of IPTG (isopropyl thiogalacto- 
pyranoside) to the growth medium. 

3.1.2. Plasmids 



Plasmids pPKL4 (comprising the entire, functional fim gene 
cluster) and pPKL114 (comprising the fimH gene) have been 
30 described previously. 

PPKL115 which is a plasmid containing the entire type 1 fim 
gene cluster with a stop linker insertion in the fimH gene 
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(i.e. this plasmid expresses all the proteins necessary for 
the production of fimbriae except the FimH protein) was 
constructed in two steps : 

(i) plasmid pPKL4 (refs. 27, 28) was digested with JCpnl which 
5 recognizes a unique restriction site in the fimH gene. The 

staggered end of the linearized plasmid was made blunt and 
ligated with the synthetic piece of DNA below (SEQ ID NO: 7) 
containing stop codons in all three reading frames, resulting 
in plasmid pPKL114: 

1 0 5 ' - GTCGACTTAATTAATTAAGTCGAC - 3 ' 

3 ' - CAGCTGAATTAATTAATTCAGCTG - 5 ' ; 

(ii) the Hirrdlll-BagrI fragment from pPKL114 # containing the 
entire fim gene cluster with the inactivated fimH gene was 
subsequently inserted into the tfindlll and EagI sites of 

15 plasmid pACYC184, resulting in plasmid pPKLH5. 

Plasmid pSM782 (generously provided by S. Molin, Department 
of Microbiology, Technical University of Denmark, DK-2800 
Lyngby) containing the pre-S2 and S encoding regions of the 
hepatitis B viral genome, was made from plasmid X-HBV1 (ref . 
20 72) by subcloning a EcoRI-Dral fragment into pBR322 . 

Plasmid pLPA22 was constructed by inserting a 1018 bp PvuII- 
Mlul fragment containing the fimH gene from pPKL4 into 
plasmid pUC18. The insert was positioned downstream and in a 
expression compatible orientation to the lac promoter resid- 
25 ing on the vector part of the plasmid (Fig. 8). Expression in 
E. coli HB101 cells of functional FimH protein was monitored 
by complementing pLPA22 in trans with pPKLllS and testing for 
MS adhesion upon induction with IPTG. 

Plasmids pLPA29 and pLPA30 were made by inserting 9-mer 
30 asymmetric Bglll- linkers into the BsaAl and Hindi sites, 
respectively, in the fimH gene of plasmid pLPA22. At six 
different positions in the pLPA22 fimH gene a Bglli site was 
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introduced without changing the reading frame, resulting in 
plasmids pLPA98, pLPA36, pLPA58, pLPA30, pLPA29 and pLPA59 
(Fig. 10) . This was done either by inserting a B^lII linker 
into an appropriately treated restriction enzyme site, or by 
5 changing 1-3 basepairs using PCR and thereby creating a Bglll 
site. 

The plasmid pLPA36 was prepared by opening the pLPA22 fiznH 
gene with the restriction enzyme Tthllll and making the ends 
blunt using Klenow polymerase and ligating using an 8 mer 
10 Bgltt. linker (SEQ ID NO: 8) : 



5 ' - CAGATCTG- 3 ' 
3 ' - GTCTAGA C-5' 

Plasmids pLPA58 and pLPA59 were made by BglTL site- creating 
site -directed mutagenesis of pLPA22 using standard PCR and 
15 plasmid pLPA98 was constructed by opening the fimH gene, 
making the ends blunt with T4 DNA polymerase and ligating 
with the below 10 mer B^III linker (SEQ ID NO: 9) : 

5 ' - GAAGATCTTC - 3 ' 
3 ' - C TTCTAGAA G - 5 ' 

20 Of the six resulting mutated fimH genes, three expressed 

protein that was integrated into type 1 fimbriae, and at the 
same time exhibited mannose- sensitive adhesion. Of these 
three mutated FimH proteins, the two that conferred to E. 
coli cells the strongest mannose- sensitive adhesion were 

25 expressed from plasmids pLPA29 and pLPA30 (Fig. 9) and these 
two plasmids were investigated further for their ability to 
contain large mutations and still be biological active. 

Plasmid pLPA29 has a 9 bp long symmetrical B^III linker 
inserted into the BsaAl site 66 bp upstream of the stop codon 
30 for the fimH gene, while plasmid pLPA30 has the same 9 bp 

Bgrtll linker inserted into the Hindi site 163 bp upstream of 
the stop codon of the fimH gene. 
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The plasmids pLPA37 and pLPA38 (Pig. 8) were constructed by 
inserting a 162 bp DNA fragment encoding the pre-S2 region of 
the Hepatitis B virus surface antigen into the Bglll sites in 
PLPA29 and pLPA30, respectively. This DNA fragment was crea- 
ted by a standard polymerase chain reaction (PCR) using the 
synthetic primers: (i) 5 ' -GGAGATCTAATTCCACAACCTT-3 ' (SEQ ID 
NO: 11) and (ii) 5 ' -GGAGATCTGTTCAGCGCAGGGT-3 ' (SEQ ID NO: 12), 
and plasmid pSM782 as a template. 

A fragment of plasmid pLPA38 comprising the inserted 
heterologous sequence encoding the pre-S2 region of hepatitis 
B surface antigen is shown in the below table wherein the 
heterologous sequence is underlined and the numbers indicated 
correspond to the positions of the amino acid residues in the 
mature FimH protein. 

Bgl II 

CAG TTC AGA TCT AAT TCC ACA ACC TTC CAC CAA ACT CTG CAA GAT 
Gin Phe Arg Ser Asn Ser Thr Thr Phe His Gin Thr Leu Gin Asp 



224 



CCC AGA GTG AGA GGC CTG TAT TTC CCT GCT GGT GGC TCC AGT TCA 
Pro Arg Val Arg Gly Leu Tyr Phe Pro Ala Gly Gly Ser Ser Ser 



GGA ACA GTA AAC CCT GTT CTG ACT ACT GCC TCT CCC TTA TCG TCA 
Gly Thr Val Asn Pro Val Leu Thr Thr Ala Ser Pro Leu Ser Ser 



ATC TTC TCG AGG ATT GGG GAC CCT GCG CTG AAC AGA TCT TCG ACG 
lie Phe Ser Arg lie Gly Asp Pro Ala Leu Asn Arg Ser Ser Thr 



226 



The plasmids pLPASS and pLPA93 (Pig. 8) were then made by 
inserting the below 51 bp synthetic double stranded DNA 
segment encoding amino acids 50-64 (comprising an epitope) of 
the B subunit of the cholera toxin into the Bgll sites on 
PLPA30 and pLPA29, respectively (SEQ ID NO: 10): 
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5 ' - GATCTGTTGAAGTTCCGGGTAGTCAGCATATCGATAGTCAGAAAAAAGCTG - 3 



3' - 



ACAACTTCAAGGCCC^TCAGTCGTATAGCTATCAGTCTTTTTTCGACCTAG- 5 



A fragment of plasmid pLPA93 comprising the heterologous 
sequence encoding the above DNA segment of the B subunit of 
the cholera toxin is shown in the below table wherein the 
heterologous sequence is underlined and the numbers indicated 
correspond to the positions of the amino acid residues in the 
mature FimH protein. 



rt° H C t GA I CT T"" GAA GTT CCG GGT AGT CAG CAT ATC GAT AGT 
Gto Phe A '9 Ser Val Glu Val Pro Gly Ser Gin His lie Asp Ser 



BamH MBgl II 
CAG AAA AAA GCT GGA TCT TCG ACG 
Gin Lys Lys Ala Gly Ser Ser Thr 
— 226 
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3.1.3. DNA r- Qchniquf »=i 

Isolation of plasmid DNA was carried out according to the 
method of Birnboim and Doly (ref. 73). Restriction endonu- 
cleases were used according to the manufacturer's specifica- 
tions (Biolabs) . DNA sequencing was carried out by the di- 
deoxy chain termination technique (ref. 49) using a sequenase 
version 2 . 0 kit from USB. Oligonucleotides were made at the 
core facilities of the Department of Microbiology, Technical 
University of Denmark. 

3.1.4. PCR mpf-hodoloery 

Polymerase chain reactions (PCR) were performed on a Perkin 
Elmer Cetus DNA Thermal Cycler 480. Reactions were set up as 
100 /xl volumes containing 200 /iM each of dATP, dCTP, dGTP and 
dTTP, 0.2-1.0 fOi of each of the two primers. 2 mM MgCl 2 , io 
mM Tris-HCl ( P H 8.3), 50 mM KC1, 2.5 units of AmpliTaq DNA 
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polymerase and 0.1-0.2 M g of plasmid template. The reactions 
were run for 25-30 cycles each consisting of l min. at 94°C, 
1 min. at 40°C, and 1 min. at 72°C. For amplification of the 
pre-S2 fragment the above primers 5 ' GGAGATCTAATTCCACAACCTT 3' 
(SEQ ID NO: 11) and 5 ' GGAGATCTGTTCAGCGCAGGGT 3'(SEQ ID NO: 12) 
were used. 

3.1.5. Hemaggl utination 
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20 



25 



The capacity of bacteria to express a D-mannose binding 
phehotype was assayed by their ability to agglutinate guinea 
pig erythrocytes on glass slides. Aliquots of liquid cultures 
grown to an optical density of 3 . 0 and 5% erythrocytes were 
mixed, and the time until agglutination occurred was 
measured . 



3.1.6. Antiata-ra 



Rabbit anti-type 1 fimbria serum raised against purified type 
1 fimbriae has previously been described (ref . 74) . a 
monoclonal antibody directed against FimH (ref. 75) was 
kindly provided from Dr. Maryvonne Dho-Moulin, Institut 
National de la Recherche Agronomique, France. Goat serum 
raised against cholera toxin B subunit (international stan- 
dard for WHO No. 12-246) produced at the State Serum Insti- 
tute, Copenhagen, Denmark was kindly provided by same insti- 
tute. A monoclonal antibody directed against the pre-S2 
domain of Hepatitis B surface antigen (ref. 76) was kindly 
provided by Dr. Makoto Mayumi, Jichi Medical School, Japan. 
Fluorescein (FITC) conjugated anti rabbit, anti mouse, or 
anti goat sera were provided from Dakopats, Denmark. 

. 3.1.7. Fluorescence lab^n-ir.g and CCD microscopy 

Cells from overnight cultures (IPTG- induced, if required) 
30 were harvested, washed in PBS and fixed for. 10 minutes at 

room temperature in a 3.5% (w/v) solution of paraformaldehyde 
in PBS. Samples of 20 /jl! were placed on a poly- L- lysine 
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coated slide and air dried. After washing in PBS, 16 nl of a 
1:5 (monoclonal) or 1:25 (polyclonal) dilution of the primary 
antiserum was placed on top of each sample and left in a 
moist incubation chamber for 1 hour. The slides were washed 
three times in PBS and 16 /*1 of FITC conjugated antiserum 
were added. After two hours in the dark, the slides were 
washed three times in PBS and a drop of Citiflour (Citiflour 
Ltd., London, U.K.) was placed on top of each sample. For 
visualization, a Carl Zeiss Axioplan microscope equipped for 
epifluorescence and phase -contrast was employed. Using a 
charge -coupled device (CCD) camera, pictures were captured as 
12 -bit files with PMIS software (Photometries) and 
subsequently transferred to a Macintosh Quadra 950 computer 
for image analysis . 

15 3.1.8. Electron microscopy. 

Electron microscopy and immuno- electron microscopy was 
carried out essentially as described previously (ref . 61) . In 
brief, a 25 nl aliquot of bacterial suspension was placed on 
a carbon- coated, glow discharged grid for 30 seconds. Grids 

20 were washed in 2 drops of PBS, dehydrated for 5 min in each 

of the following concentrations of ethanol: 25%, 50%, 75% and 
96%, blotted dry and shadowed with tungsten wire at an angle 
of 30°. For immuno -electron microscopy a monoclonal antibody 
directed against the pre-S2 region was used diluted 1:5 as 

25 the primary antibody and rabbit anti -mouse serum conjugated 
with 10 nm gold particles (Dako) was used in dilution 1:20 as 
the secondary antibody. 



3.2. Results 

As described above, two positions in the C- terminal part of 
the FimH protein were engineered to contain heterologous 
sequences mimicing foreign antigenic determinants. In the 
present study, double plasmid systems were used. In each 
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plasmid pair one encoded either a wild- type or an engineered 
version of the fimH gene, whereas the second plasmid encoded 
auxiliary functions such as the two -component Fim- specific 
transport system, regulatory genes and other structural 
components of the fimbrial organelle except FimH (Table 3) . 

3.2.1. Engineering new res triction sites into flmH. 

Based on algorithms for prediction of such parameters as 
hydrophilicity and secondary structure, two potentially 
optimal positions for insertions of heterologous sequences in 
the C- terminal domain of the FimH protein were selected. 
These correspond to positions 225 and 258 in the mature 
protein predicted to be situated in a surf ace -exposed part of 
the FimH protein. In order to facilitate later manipulations, 
the fimH gene was subcloned into the pUC18 vector resulting 
in plasmid pLPA22 . Subsequently a Bglll site was introduced 
in-frame into positions 225 and 258, respectively. This was 
carried out by site-directed mutagenesis employing synthetic 
oligomers resulting in plasmids pLPA30 and pLPA29, respect - 
20 ively (Fig. 9). 
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15 



The introduced Bgrlll sites resulted in a codon change from a 
Leu to a Phe codon in position 225 and addition of codons for 
the sequence Arg-Ser-Ser, in the case of plasmid pLPA29, and 
addition of codons for the sequence Arg-Ser-Gly after posi- 

25 tion 258 in the case of plasmid pLPA30. Sequence analysis of 
the entire modified fimH genes in plasmids pLPA29 and pLPA30 
confirmed that no other changes had occurred. Host cells 
which in addition to plasmid pLPA29 or pLPA30 also contained 
plasmid pPKL115 (fimH) , showed wild- type phenotypic charac- 

30 teristics with regard to adhesion and fimbriation as judged 
by such criteria as hemagglutination (Table 3) and immuno- 
fluorescence microscopy. 
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3.2.3. Engineering het erologous DNA- sequences encoding the 
pre-S2 domain of hepatitis B surface a ntigen and a cholera 
toxin epitope into fimlj 1 

As heterologous reporter epitopes the pre-S2 region of the 
5 hepatitis B surface antigen and a well characterized region 
of the B subunit of cholera toxin were selected. The pre-S2 
region is known to contain immunologically important (and 
protective) antigenic determinants (ref. 76). In addition, 
this region is disulphide bond- independent and apparently 
10 more immunogenic than the major S protein. The cholera toxin 
segment consists of residues 50-64 of the B subunit and has 
previously been shown to elicit antibodies that bind to and 
neutralize cholera toxin (ref. 77) . 

15 A DNA segment of 162 nucleotides encoding 52 of the 55 amino 
acids of the pre-S2 region was amplified by PCR technology 
using plasmid pSM782 as template and primers that provided 
the amplified sequence with flanking Bgrlll sites. Following 
restriction with BgllX and purification the amplified frag- 
ment was inserted into the Bgln sites of plasmids pLPA29 and 
PLPA30 resulting in plasmids pLPA37 and pLPA38, respectively 
(Fig 9) . Subsequent sequence analysis confirmed that the 
inserts were correctly oriented and that the reading frame of 
the chimeric fimff-pre-S2 genes was correct. 



20 
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A synthetic DNA segment encoding the cholera epitope was made 
by annealing two complementary 51 bp oligonucleotides which 
were designed to result in a double stranded DNA fragment 
with a Bg-lll overhang in one end, a BairiHI overhang in the 
other and an internal Clal site. The epitope- encoding segment 
was inserted into the Bgrlll site in the fimH gene in plasmids 
PLPA29 and pLPA30, resulting in regeneration of a Balll site 
at only one end of the insert. This feature was used to 
identify plasmids with correct orientation of the insert. The 
presence of the Clal site was used for initial screening for 
clones containing the insert. Sequence analysis of plasmid 
PLPA93 and pLPA95, both harbouring the epitope -encoding 
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segment confirmed the orientation and conservation of the 
reading frame in the chimeric fimH- cholera genes (Fig 8) . 

3.2.4. Expression of ch imeric FimH adhesin comprising as 
heterologous seauencea th e pre-S2 domain of hepatitis n 
surface antigen and a cholera toxin epitope. 



To evaluate whether the heterologous inserts in fimH were 
compatible with protein expression the T7 polymerase/promoter 
system of Tabor and Richardson (ref . 78) was used. Subcloning 
into the pGEM3 vector system and subsequent assaying revealed 
that proteins with the expected sizes were produced in all 
cases from the chimeric fimH genes. More importantly, to 
assess whether the FimH proteins harbouring foreign inserts 
were accepted by the type l fimbrial transport system and 
additionally, whether they were present on the bacterial 
surface in a biologically functional form, the adhesion 
phenotype of recombinant strains expressing the chimeric FimH 
proteins was studied. 



20 



Bacterial hosts which in addition to plasmid pLPA38 (pre-S2 
insert in position 225 in FimH) also contained plasmid 
PPKL115 (fimH) gave, when induced by IPTG, good agglutination 
of guinea-pig erythrocytes indicating the presence of a 
biologically active form of the FimH adhesin on the cells 
(Table 3). The combination of plasmids pLPA37 (pre-S2 in 
25 position 258 in FimH) and pPKLH5 resulted in weaker, but 
detectable, hemagglutination (Table 3) . Furthermore, such 
cells were also shown by electron microscopy to have essen- 
tially normal fimbriation (Fig. 10) . 
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Table 3. Genotype and phenotype of plasmidg (A. B or ir- 
respectively indicate PACYC184. pB R 322 or pUC18 based vector) 
used in this study, position o f inserts and hemagglutination 
titer. Host ce ll: E. coll HB101 



Plasmid 



relevant genotype 



insert 
position 



hemagglu- 
tination 51 





pPKL4 (B) 


all fim genes 




15 




PPKL115 (A) 


fimH 




>600 


10 


pLPA22 (U) 






>600 




pLPA29 (U) 


fimH- Bglll 


258 


>600 




PLPA30 (U) 


fijnff- Bglll 


225 


>600 




pLPA37 (U) 


.fijnff-pre-S2 


258 


>600 




pLPA38 (U) 


fijnff-pre-S2 


225 


>600 


15 


PLPA93 (U) 


fimH- cholera 


225 


>600 




pLPA95 (U) 


fimH- cholera 


258 


>600 




PLPA22 (U) 
+PPKL115 (A) 


fimH* 
fimH 




10 


20 


PLPA29 
+PPKL115 


fimH- Bglll 
fimH 




7 




pLPA30 
+pPKL115 


fimH- Bglll 
fimH 




8 




PLPA37 
+PPKL115 


f imH- pre- S2 
fimH 




210 


25 


PLPA38 
+PPKL115 


fimH- pre -S2 
fimH 




100 




pLPA93 
+pPKL115 


fimH- cholera 
fimH 




11 


30 


PLPA95 
+PPKL115 


fimH- cholera 
fimH 




16 



a) Hemagglutination of guinea-pig erythrocytes indicated in 
seconds before reaction occurred. The average values of 4 
measurements are given. 
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In the cases where a sequence mimicing a cholera epitope had 
been inserted into FimH, viz. pLPA93 (insert in position 225) 
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and pLPA95 (insert in position 258) , respectively, an agglu- 
tination phenotype also resulted when either of these plas- 
mids were complemented by plasmid pPKL115 (fimH) (Table 3) . 
Again, this suggested that in spite of the presence of 
foreign peptide segments the chimeric FimH proteins were 
still able to reach the bacterial surface and maintain its 
adhesive function. In addition to the adherence phenotypes of 
the various clones the presence of engineered FimH adhesins 
on the surface of the cells were monitored by CCD microscopy 
in connection with fluorescent antibody methodology employing 
a FimH- specific monoclonal serum. In all cases, significant 
signals, albeit of varying intensity, were detected when 
compared to a negative control strain that harboured the 
auxiliary plasmid, pPKLH5, alone. 

15 3 t 2.5. Immunological dP^r t ion of fhe pre-SS geoment- nf 

hepatitjg B surface antigen and t he chnlpra toxin epitope in 
chimeric FimH adheains. 

Since there was good evidence that the chimeric FimH proteins 
were present on the surface of the E. coll hosts the ability 
of specific antisera, raised against the pre-S2 part of 
hepatitis B surface antigen or the cholera toxin B chain, 
respectively to recognize the chimeric FimH-pre-S2 and FimH- 
cholera proteins directly on the surface of the recombinant 
bacteria were tested. By immunofluorescence microscopy E. 
coli hosts harbouring either of plasmids pLPA37 or pLPA38 in 
addition to plasmid pPKLH5 were shown to react specifically 
with antisera directed against the inserted heterologous 
sequence, whereas hosts expressing wild- type FimH did not. 
Similar results were obtained with the cholera toxin insert 
in the same positions (plasmids pLPA93/pPKL115 and 
pLPA95/pPKLH5) . Again, the heterologous inserts in the 
chimeric FimH proteins were recognized by insert -specific 
serum on the bacterial surface, whereas the relevant control 
did not react. 
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These findings demonstrate that the foreign epitopes are 
exposed on the surface of extracellularily located chimeric 
FimH proteins and, significantly, in a conformation which 
mimics the natural conformation of the epitope (s) as it 
appears in the native hepatitis B surface antigen or the 
native cholera toxin. 

The results obtained by immunofluorescence microscopy were 
corroborated by immuno- electron microscopy, employing the 
pre-S2 specific monoclonal antibody as primary serum and a 
colloid gold- labelled secondary antiserum. A significant 
amount of gold particles were seen, mostly in connection with 
the fimbrial organelles, on bacterial hosts harbouring chim- 
eric r-ijnff-pre-S2 genes (Fig. 10b and 10c) , whereas only few 
goldparticles were present on the control strain expressing 
wild-type fimbriae (Fig. 10a) . Furthermore, in the latter 
case the gold -particles were not seen to be associated with 
the fimbriae. 



20 



The plasmids pLPA22, pLPA29, pLPA30, pLPA37, pLPA38, pLPA93, 
PLPA95 and pPKLUS in B. coll HB101 were deposited on 26 
January 1994 with DSM, the Deutsche Sammlung von Mikro- 
organismen und Zellkulturen GmbH, (German Collection of 
Microorganisms and Cell Cultures) , Mascheroder Weg IB, D- 
38124 Braunschweig, Germany, under the accession numbers DSM 
8915, DSM 8916, DSM 8917, DSM 8918, DSM 8919, DSM 8920, DSM 
25 8921 and DSM 8923, respectively. 



EXAMPLE 4 

Binding of the MFP c1afl 3 adhesin of E . noli rg H -50 ro qyn - 
thetic peptides 

14 synthetic peptides were synthesized on an ABI automated 
30 peptide synthesizer according to the method of Merrifield 

(Merrifield, R.B. 1963. Solid phase peptide synthesis. I. The 
synthesis of tetrapeptide. J. Am. Chem. Soc. 85:2149). The 
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binding of the E. coli strain CSH-50 to these peptides were 
tested essentially as described in Example 1. The results of 
these binding assays indicated that this MFP class strain 
adhered strongly to one group of peptides whereas the binding 
5 the an other group of peptides was absent or weak. In the 
below listing the one- letter code sequences of the synthetic 
peptides are shown in a + group, i.e. the group of peptides 
to which the tested strain adhered strongly, and a - group of 
peptides to which the binding was weak or absent: 



10 + grOUP Of ptapl-iri^a 



FnSPl: EAQQMVQPQSPVAVSQSKPGCYDNGKHYQI (SEQ ID NO: 13) 

CB-II-G: EEGKRGARGEBGAAGPVGPBGERGARGNR (SEQ ID NO: 14) 

SMK19-32): AIQNIRLRHENKDL (SEQ ID NO: 15) 

SM6(1-11) : RVFPRGTVENPC (SEQ ID NO: 16) 

15 SM12(1-12): DHSDLVAEKQRLC (SEQ ID NO: 17) 

SM12(7-18): AEKQRLEDLGQKC (SEQ ID NO: 18) 

SM5 (175-184) : TVKDKLAKEQC (SEQ ID NO:19) 

SM5 (28-54): KTKNEGLKTENEGLKTENEGLKTENEGC (SEQ IDNO:20) 
- grou p of peptides 



20 SM5 (134-163) : QESKENEKALNELLEKTVKDKIAKEQENKE (SEQ ID NO:21) 
SM5 (117-146) : DLTKELNKTRQELANKQQESKENEKALNEL (SEQ ID NO:22) 
SM5 (14-26): KEALDKYELENHD (SEQ ID NO: 23) 
SM6 (22-31): DVENSMLQAN (SEQ ID NO:24) 

SM5 (55 - 84) : LKTEKSNLERKTAELTS EKKEHEAENDKLKC (SEQ ID NO : 2 5 ) 
25 SM24 (289-303) : HQKLEEQNKTSEASRC (SEQ ID NO:26) 



EXAMPLE 5 

.FimH adhesin of furthsr clinical iaol*t-F>g 

The following clinical isolates of E. coli were tested for 
adhesion class according to the methods described in Example 
1: KB-23, KS-54, U221-3, MJ#9-3, MJ#31-3, MJ#ll-2, MJ#2-2. 
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The results of these experiments are illustrated in Fig. 5. 
As explained above, the isolate KB- 2 3 showed the M L type of 
adhesion, and the isolate U221-3 expressed a M class adhesin 
showing a mannose- resistant type of adhesion and accordingly, 
5 this strain was classified as having a M R class adhesin. 

The amino acid sequences of these clinical isolates are shown 
in Fig. 5 and their nucleotide sequences in Table 5 below. 

Table 5 shows the nucleotide sequences of the fimH genes of 
selected fimH genes disclosed in Example l [CI#3 (SEQ ID 

10 N0:50), CI#4 (SEQ IDN0:44), CT#7 (SEQ IDN0:51), CT#10 (SEQ 
ID NO: 48) and CI#12 (SEQ ID NO: 54)] and as the reference that 
of the E. coli K12 strain PC31 as it was originally disclosed 
by Klemm et al. (ref . 27) as the top sequence designated 
PC3la and the sequence as it was determined recently (PC31b) . 

15 Additionally, the nucleotide sequences of the following 

clinical isolates of E. coli are shown: KS54 (SEQ ID NO:52), 
U221-3 (SEQ ID NO:53), MJ#9-3 (SEQ ID NO:46), MJ#31-3 (SEQ ID 
N0:47), MJ#ll-2 (SEQ ID NO:43), MJ#2-2 (SEQ ID NO:45) and F- 
18 (SEQ ID NO:42) . 
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Table 5. Nucleotide sequences of the above flmH genes dis- 
closed in Example 1, E. coll K12 strain PC31 (PC31a and 
PC31b) and the nucleotide sequences of KS54, U221-3, MJ#9-3, 
MJ#31-3, MJ#ll-2, MJ#2-2 and F-18. 



1094 



PC31* 

F-18 
MJ11-2 
CZ 4 
MJ22 
MJ9-3 
MJ31-3 
CI 10 
PC3lb 
CI 3 
CI 7 
KS54 
U221-3 
CI 12 

1148 

PC31& 
F-18 
MJ11-2 
CI 4 
MJ22 

pc3u> zzz zzz zzz zzz z 

1202 

JS^ fff !ff ff f ™ !!I TAT GT * CTT CCC GTC GTG AAT GTG GGG CAA AAC 



-If ^ ff? *!! *f f flf ffT CT * CT G Axe GGG T GG TCC CTA AAT 



1148 

~ Iff If^ Ilf ff f If! ^ « fff fff ACC GCT ATC CCT ATT GGC GGT 



F-18 

MJ11-2 

CI 4 

MJ22 

MJ9-3 T 



T -C- 

T -C- 

T -C- 

X -C- 



-C- 



PC31b 



-C- 



U221-3 

CI 12 " * C ~ 



— T -C- 
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Table 5, continued 



1256 

F-l^ -If f If f *I "I Iff *f f ATC " T TGC ^ ^ GAT ™ CCG GAA ACC 
MJ11-2 A ZZZ ZZZ I~ "I ~Z ~I ™ £ 

mj9-3 a 1 Z~ III ~3 ~I ZZZ ZZZ Z c ~~ A 



KSS4 
U221-3 
CI 12 



13X2 

^f ^ f^f I" flf ?± CGA «J TCG GCT TAT GGC GGC GTG TTA TCT 



PC31* 
F-18 
MJ11-2 
CI 4 
MJ22 
MJ9-3 

CI 3 T III ZZZ ZZZ ZZZ 

CI 12 



PC31* 

F-18 



1366 

SI IH Iff fff *ff *** ™ AGT GGC AG * AGC TAT CCA TTT CCT ACC ACC 

MJ11-2 -G ZZZ Z * T ~ C T 

CI 4 -G A f T "~ C T 

MJ22 -G . "~ ~" "I ^ZZ 1 T 

ci io ZZZ ZZZ ZZZ ZZZ ZZZ G " T — 

U221-3 — > Z ZZZ ZZZ ZZZ- ZZZ ZZZ ZZZ ~ZZ G " T — 
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PC31* 
F-18 



1420 

AGC GAA ACG CCG CGC GTT GTT TAT AAT rrr ~ . 

G „- CG GAT AAG CCG 7GG CCG GTG 



MJ22 
MJ9-3 
MJ31-3 
CI 10 

PC31to 

KS54 T "° 

0221-3 
CZ 12 



PC31* 



1474 



F-18 2 ff fH ™ ™ « « !f *f! « « « « O" «* *AX GCTGGC 

1S28 

FC31» TCA TTA ATT GCC GTG CTT ATT TTG era r-*i- »~ 
F-18 . .„ CC ^ ^ TAT AGC GAT GAT 



CI 12 
PC31* 
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PC3la 

F-18 

MJ11-2 

CI 4 

MJ22 
MJ9-3 
MJ31-3 
CI 10 
PC3Xb 
CI 3 
CI 7 
KS54 
U221-3 
CI 12 



XS82 




Z ~ c : ~ 1:1 

»C31« OOC TGC GAI GTT TCT OCT CGT 



F-ie 

AJll-2 t 



~ p ff! ffl « « *f f «* ACT CTG CCG GAC TAC CGT GGT 



CI 4 j 

mj22^ T c ZZZ ZZZ ZZZ Z T ™ • C 



MJ9-3 
MJ31-3 
CI 10 
PC3U) 

CI 3 

CI 7 

ICS54 

U221-3 

CI 12 



1690 T ° 

^18 im !^ !!f ffj A " OX CTT ACC GTT TAT TGT GCG AAA AGC CAA «r „ 
Mj X1 - 2 ~ £ _ *^ CAA *AC CTG GGG TAT 



CI 

MJ22 
MJ9-3 
MJ31-3 
CI 10 
PC31b 

CI 3 

CI 7 

KS54 

U221-3 

CI 12 
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Table 5, continued 



1744 

-~ -~ ™ !ff ^ f!f ?*I fff fff ICG ATT TTC ACC AAT ACC GCG 



1798 



F "s If ZH I~ f 2 f!J « « « fi* « ™ ^ !!f *"= 051 *» « 



HJ11-2 
CI 4 
MJ22 
MJ9-3 
MJ31-3 
CI 10 
PC31b 



CI 7 
KS54 
U221-3 
CI 12 



1852 

-H f » ff f ™ ™ *f f ff* Iff ™ ff A «» GT * GGG ACT TCG GCG GTG ACT 



1852 

PC31a 

F-18 
MJ11-2 
CI 4 

MJ22 A 
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Table 5, continued 



X906 




FC31a 

F-18 
KJ11-2 
CI 4 
MJ22 
MJ9-3 
MJ31-3 
CI 10 
PC3U) 
CI 3 
CI 7 
KS54 
U221-3 
CI 12 



I960 

GTG CAA TCG ATT ATT GGC GTG ACT TTT GTT TAT CAA 
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EXAMPLE 6 



Enrichment selection of strains havi n g mutated FimH arihpsin. 
conferring a ltered adhesion ability 

One mechanism whereby new binding activities of bacterial 
adhesins may arise is by random, naturally occurring mutage- 
nesis. In nature, a variety of factors would enrich for 
strains that possessed adhesive capacities conferring a 
selective advantage. In the present example an in vitro 
procedure was used to select for potential mutants with 
altered adhesive capacity. As a target substratum bovine k- 
casein was selected. 

K r casein is the glycosylated isoform of bovine casein con- 
sists of a single polypeptide chain containing 169 amino acid 
residues the sequence of which has been determined (ref. 68). 
Bovine *- casein does not contain N-glycosidic linkages, but 
up to six 0- linked oligosaccharides are present in the C- 
terminal region of the molecule (refs. 68, 69). The sacchari- 
de moieties are heterologous and also vary as a function of 
time after parturition. Of significance for the present study 
is the fact that D-mannose is not present in the bovine k- 
casein. Only di- to hexasaccharides containing galactose, N- 
acetyl-galactosamine, N- acetyl -glucosamine, fucose and sialic 
acid have been described (ref. 68). Glycoproteins having such 
saccharide compositions would not be expected to serve as a 
receptor for the classic type of the FimH adhesin such as is 
found in E. coll strain PC31. 

Adhesion tests were performed to verify the inability of 
recombinant strains carrying the fimH gene from B. coli 
strain PC31 to adhere to immobilized k- casein. The E. coli 
strain used, KB1001 is HB101 containing plasmids pPKLUS and 
PLPA22 (ref. 70). The adhesion assay was performed using 
microtiter plates coated with 30 fig/ml K-casein in 0.1 M 
sodium bicarbonate (pH 9.6) for 30 minutes, followed by 
blocking any remaining binding sites with a subsequent in- 
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cubation with 0.1% bovine serum albumin (BSA) in PBS. A 
quantitative adhesin assay was performed as described in more 
detail elsewhere (ref. 71). Briefly, bacteria were diluted to 
equivalent concentrations (5 x 10 7 cells/100/zl) in PBS 
5 containing 0.1% BSA, added to coated microtiter wells for 30 
minutes at 37°C. After washing the wells thoroughly to remove 
unbound bacteria, BHI broth was added and the bacteria were 
allowed to grow at 37°C on a rotating platform (150 rpm) 
until the optical density could be measured (2-2.5 hours) . 
10 Comparisons can be made of optical densities obtained in the 
test wells to those obtained in standard curves developed 
from the plating of known numbers of bacteria under similar 
conditions, allowing extrapolation to absolute numbers of 
bound bacteria (ref. 70) . 

15 The KB1001 strain comprising the fimH gene from PC31 bound to 
immobilized mannan in significant numbers, but there was 
substantially no measurable adhesion to immobilized k -casein. 
To select for possible mutant cells having acquired the 
ability to bind to x-casein, cells of KB1001 were allowed to 
interact with K-casein immobilized on microtiter wells. After 
thorough washing to remove non-adhering bacterial cells, 
cells adhering to the wells were collected and grown over- 
night in BHI broth. These "enriched- bacterial cultures were 
again allowed to interact with immobilized k- casein, the 
plates were washed and adhering cells collected in nutrient 
broth. This enrichment cycle was repeated up to ten times. 
Bacterial cells obtained from the last of these cycles ("en- 
riched" strains) adhered to k- casein in significantly in- 
creased numbers in comparison to the parent ( "non -enriched " ) 
30 strain (Table 6.1). Individual colonies of "enriched" KB1001 
were isolated and four tested for ability to adhere to k- 
casein. Three enriched cultures (clones) bound to K-casein 
significantly better than did the non- enriched parent strain. 



20 



25 
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Table 6.1. Adhesion to casein of n on-enriched and enrichPri g 
coll strain KBlOOl . 



Strain 



bacteria binding to 



if -casein' 



a) 



Non- enriched KBlOOl 
(pPKL115 + pLPA22) 



0.043 + 0.018 



0 



Enriched KBlOOl 
(pPKL115 + pLPA22) 



0.249 + 0.004 



a) numbers represent optical density of bacterial growth + 
S.D. with background O.D. substracted. N = 3. 

To determine whether the new adhesive activity was due to 
plasmid- related changes and not simply to host cell -related 
changes, plasmid preparations of pLPA22 were made from en- 
riched and from non- enriched strains and used to transform E. 
coli HB101 containing the auxiliary plasmid pPKLllS. Randomly 
selected transf ormants resistant to ampicillin and chloram- 
phenicol were tested for adhesion to k- casein, and several of 
the transformants harbouring plasmids from enriched cultures 
adhered in significantly increased numbers relative to plas- 
mid -containing cells of the non-enriched parent strain (Table 
6.2). 



Table 6.2. Adhesion to casein of mini , (ppki.h m transfn^oH 
with plasmids from enrich^ or nnn-Pnri ch ed nt-.raln KBinm 



Plasmid derived from: 



bacteria binding to 



k- casein 



a) 



Non -enriched KBlOOl 



5 ± 0.1 x 10 : 



Enriched KBlOOl 



50 + 1.5 x 10 : 



WO 95/20657 PCT/DK95/00042 

a) numbers represent mean number of bacteria per well + S.D. 
N = 3. 

The above results demonstrate that random or spontaneous 
mutations in genes coding for a bacterial adhesin that confer 
5 binding to a new substratum (i.e. a receptor moiety to which 
the parent strain does not bind) , can be selected for by 
appropriate in vitro procedures. 
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are not for all designated States) 



E. SEPARATE FURNISHING OF ^ INDICATIONS Qa^kh^lfnct.ppT.cabk) 



'Accession 



For receiving Office use only 



This sheet was received with the international application 



Authorized officer 




For International Bureau use only 



is sheet was received by the International Bureau < 



Authorized officer 
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INDICATIONS RELATING TO DEPOSITED MICROORGANISMS 

(PCT Rule 12bis) 

Additional sheet 

In addition to the microorganism indicated on page 53 of the 
5 description, the following microorganisms have been deposited 
with r 

DSM-Deutsche Sammlung von Mikroorganismen und 
Cellkulturen GmbH 

Mascheroder Weg lb, D-38124 Braunschweig, Germany 
10 on the dates and under the accession numbers as stated below: 

Accession Date of Description Description 

number deposit Page No. Line No. 



15 



20 



25 



DSM 


8915 


26 


January 


1994 


66 


24 


DSM 


8916 


26 


January- 


1994 


66 


24 


DSM 


8917 


26 


January 


1994 


66 


24 


DSM 


8918 


26 


January 


1994 


66 


24 


DSM 


8919 


26 


January 


1994 


66 


24 


DSM 


8920 


26 


J anuary 


1994 


66 


24 


DSM 


8921 


26 


January 


1994 


66 


25 


DSM 


8923 


26 


January 


1994 


66 


25 



For all of the above -identified deposited microorganisms, the 
followxng additional indications apply: 



As regards the respective Patent Offices of the respective 
designated states, the applicants request that a sample of 
the deposited microorganisms stated above only be made 
available to an expert nominated by the requester until the 
date on which the patent is granted or the date on which the 
30 application has been refused or withdrawn or is deemed to be 
withdrawn. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: GX BioSystems A/S 

(B) STREET: Mothsvej 70 

(C) CITY: Holte 

(D) COUNTRY : Denmark 

(E) POSTAL CODE (ZIP) : 2840 

(ii) TITLE OF INVENTION: Receptor specific bacterial adhesins and 
their use 

(iii) NUMBER OF SEQUENCES: 55 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC conpatible 

(C) OPERATING SYSTEM: PC -DOS /MS -DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 300 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

Met Lys Arg Val He Thr Leu Phe Ala Val Leu Leu Met Gly Trp Ser 
1 5 10 15 

Val Asn Ala Trp Ser Phe Ala Cys Lys Thr Ala Asn Gly Thr Ala He 
20 25 30 

Pro He Gly Gly Gly Ser Ala Asn Val Tyr Val Asn Leu Ala Pro Val 
35 40 45 

Val Asn Val Gly Gin Asn Leu Val Val Asp Leu Ser Thr Gin He Phe 
50 55 go 

Cys His Asn Asp Tyr Pro Glu Thr He Thr Asp Tyr Val Thr Leu Gin 
65 70 75 80 

Arg Gly Ser Ala Tyr Gly Gly Val Leu Ser Asn Phe Ser Gly Thr Val 
85 90 95 

Lys Tyr Ser Gly Ser Ser Tyr Pro Phe Pro Thr Thr Ser Glu Thr Pro 
100 105 no 
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Arg Val Val Tyr Asn Ser Arg Thr Asp Lys Pro Trp Pro Val Ala Leu 
115 120 125 

Tyr Leu Thr Pro Val Ser Ser Ala Gly Gly Val Ala lie Lys Ala Gly 
130 135 140 

Ser Leu lie Ala Val Leu lie Leu Arg Gin Thr Asn Asn Tyr Asn Ser 
145 150 155 160 

Asp Asp Phe Gin Phe Val Trp Asn lie Tyr Ala Asn Asn Asp Val Val 
165 170 175 

Val Pro Thr Gly Gly Cys Asp Val Ser Ala Arg Asp Val Thr Val Thr 
180 185 190 

Leu Pro Asp Tyr Pro Gly Ser Val Pro lie Pro Leu Thr Val Tyr Cys 
195 200 205 

Ala Lys Ser Gin Asn Leu Gly Tyr Tyr Leu Ser Gly Thr His Ala Asp 
210 215 220 

Ala Gly Asn Ser lie Phe Thr Asn Thr Ala Ser Phe Ser Pro Ala Gin 
225 230 235 240 

Gly Val Gly Val Gin Leu Thr Arg Asn Gly Thr He He Pro Ala Asn 
245 250 255 

Asn Thr Val Ser Leu Gly Ala Val Gly Thr Ser Ala Val Ser Leu Gly 
2€0 265 270 

Leu Thr Ala Asn Tyr Ala Arg Thr Gly Gly Gin Val Thr Ala Gly Asn 
275 280 285 

Val Gin Ser He He Gly Val Thr Phe Val Tyr Gin 
2 90 295 300 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Glu Ala Gin Gin Met Val Gin Pro Gin Ser Pro Val Ala Val Ser Gin 



1 



5 



10 



15 



Ser 



Lys Pro Gly Cys Tyr Asp Asn Gly Lys His Tyr Gin He 
20 25 30 



(2) INFORMATION FOR SEQ ID NO: 3: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 
GGGGGGTGCA CACCTACAGC TGAACCCGG 
(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
GGGGGTGCAC TCAGGGAACC ATTCAGGCA 
(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
GGGTGCGCAT TATTGATAAA CAAAAGTCAC 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
GGGGCATGCT TATTGATAAA CAAAAGTCAC 
(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
GTCGACTTAA TTAATTAAGT CGAC 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
CAGATCTG 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
GAAGATCTTC 

10 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:10: 
GATCCAGCTT TTTTCTGACT ATCGATATGC TGACTACCCG GAACTTCAAC A 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
GGAGATCTAA TTCCACAACC TT 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
GGAGATCTGT TCAGCGCAGG GT 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 3 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 



22 



22 



Glu Ala Gin Gin Met Val Gin Pro Gin Ser Pro Val Ala Val Ser Gin 
1 5 10 15 
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Ser Lys Pro Gly Cys Tyr Asp Asn Gly Lys His Tyr Gin lie 
20 25 30 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Glu Glu Gly Lys Arg Gly Ala Arg Gly Glu Asx Gly Ala Ala Gly Pro 
15 10 15 

Val Gly Pro Asx Gly Glu Arg Gly Ala Arg Gly Asn Arg 
20 25 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Ala lie Gin Asn lie Arg Leu Arg His Glu Asn Lys Asp Leu 
15 10 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Arg Val Phe Pro Arg Gly Thr Val Glu Asn Pro Cys 
15 10 

(2) INFORMATION FOR SEQ ID NO: 17: 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Asp His Ser Asp Leu Val Ala Glu Lys Gin Arg Leu Cys 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
Ala Glu Lys Gin Arg Leu Glu Asp Leu Gly Gin Lys Cys 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Thr Val Lys Asp Lys Leu Ala Lys Glu Gin Cys 
15 10 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: linear 



1 



5 



10 
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(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Lys Thr Lys Asn Glu Gly Leu Lys Thr Glu Asn Glu Gly Leu Lys Thr 
1 5 10 15 

Glu Asn Glu Gly Leu Lys Thr Glu Asn Glu Gly Cys 
20 25 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

Gin Glu Ser Lys Glu Asn Glu Lys Ala Leu Asn Glu Leu Leu Glu Lys 
1 5 10 15 

Thr Val Lys Asp Lys He Ala Lys Glu Gin Glu Asn Lys Glu 
20 25 30 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 



Asp Leu Thr Lys Glu Leu Asn Lys 
1 5 

Gin Gin Glu Ser Lys Glu Asn Glu 
20 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNES S : unknown 



Thr Arg Gin Glu Leu Ala Asn Lys 
10 15 

Lys Ala Leu Asn Glu Leu 
25 30 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

Lys Glu Ala Leu Asp Lys Tyr Glu Leu Glu Asn His Asp 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
Asp Val Glu Asn Ser Met Leu Gin Ala Asn 



(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

Leu Lys Thr Glu Lys Ser Asn Leu Glu Arg Lys Thr Ala Glu Leu Thr 
1 5 10 15 

Ser Glu Lys Lys Glu His Glu Ala Glu Asn Asp Lys Leu Lys Cys 
20 25 30 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

His Gin Lys Leu Glu Glu Gin Asn Lys Thr Ser Glu Ala Ser Arg Cys 
15 10 is 



(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 300 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

Met Lys Arg Val He Thr Leu Phe Ala Val Leu Leu Met Gly Trp Ser 
15 10 15 

Val Asn Ala Trp Ser Phe Ala Cys Lys Thr Ala Asn Gly Thr Ala He 
20 25 30 

Pro He Gly Gly Gly Ser Ala Asn Val Tyr Val Asn Leu Ala Pro Ala 
35 40 45 

Val Asn Val Gly Gin Asn Leu Val Val Asp Leu Ser Thr Gin He Phe 
50 55 eo 

Cys His Asn Asp Tyr Pro Glu Thr He Thr Asp Tyr Val Thr Leu Gin 
65 70 75 80 

Arg Gly Ser Ala Tyr Gly Gly Val Leu Ser Asn Phe Ser Gly Thr Val 
85 90 95 

Lys Tyr Ser Gly Ser Ser Tyr Pro Phe Pro Thr Thr Ser Glu Thr Pro 
100 105 no 

Arg Val Val Tyr Asn Ser Arg Thr Asp Lys Pro Trp Pro Val Ala Leu 
115 120 125 

Tyr Leu Thr Pro Val Ser Ser Ala Gly Gly Val Ala He Lys Ala Glv 
"0 135 i 4 o 

Ser Leu He Ala Val Leu He Leu Arg Gin Thr Asn Asn Tyr Asn Ser 
145 150 155 i6o 

Asp Asp Phe Gin Phe Val Trp Asn He Tyr Ala Asn Asn Asp Val Val 
165 ivo i7 5 

Val Pro Thr Gly Gly Cys Asp Val Ser Ala Arg Asp Val Thr Val Thr 
180 185 190 



Leu Pro Asp Tyr Pro Gly Ser Val Pro He Pro Leu Thr Val 



Tyr Cys 
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195 200 205 

Ala Lys Ser Gin Asn Leu Gly Tyr Tyr Leu Ser Gly Thr His Ala Asp 
210 215 220 

Ala Gly Asn Ser He Phe Thr Asn Thr Ala Ser Phe Ser Pro Ala Gin 
225 230 235 240 

Gly Val Gly Val Gin Leu Thr Arg Asn Gly Thr He He Pro Ala Asn 
245 250 255 

Asn Thr Val Ser Leu Gly Ala Val Gly Thr Ser Ala Val Ser Leu Gly 
260 265 270 

Leu Thr Ala Asn Tyr Ala Arg Thr Gly Gly Gin Val Thr Ala Gly Asn 
275 280 285 

Val Gin Ser He He Gly Val Thr Phe Val Tyr Gin 
290 295 300 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 300 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

Met Lys Arg Val He Thr Leu Phe Ala Val Leu Leu Met Gly Trp Ser 

Val Asn Ala Trp Ser Phe Ala Cys Lys Thr Ala Asn Gly Thr Ala He 
20 25 30 

Pro He Gly Gly Gly Ser Ala Asn Val Tyr Val Asn Leu Ala Pro Ala 
35 40 45 

Val Asn Val Gly Gin Asn Leu Val Val Asp Leu Ser Thr Gin He Phe 
50 55 go 

Cys His Asn Asp Tyr Pro Glu Thr He Thr Asp Tyr Val Thr Leu Gin 
65 7 0 75 ao 

Arg Gly Ser Ala Tyr Gly Gly Val Leu Ser Ser Phe Ser Gly Thr Val 
35 90 95 

Lys Tyr Asn Gly Ser Ser Tyr Pro Phe Pro Thr Thr Ser Glu Thr Pro 
100 105 no 

Arg Val Val Tyr Asn Ser Arg Thr Asp Lys Pro Trp Pro Val Ala Leu 
115 120 125 
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Tyr Leu Thr Pro Val Ser Ser Ala Gly Gly Val Ala lie Lys Ala Gly 
130 135 140 

Ser Leu He Ala Val Leu He Leu Arg Gin Thr Asn Aen Tyr Asn Ser 
145 150 155 160 

Asp Asp Phe Gin Phe Val Trp Asn He Tyr Ala Asn Asn Asp Val Val 
165 170 175 

Val Pro Thr Gly Gly Cys Asp Val Ser Ala Cys Asp Val Thr Val Thr 
!80 185 190 

Leu Pro Asp Tyr Pro Gly Ser Val Pro He Pro Leu Thr Val Tyr Cys 
"5 200 205 

Ala Lys Ser Gin Asn Leu Gly Tyr Tyr Leu Ser Gly Thr His Ala Asp 
210 215 220 

Ala Gly Asn Ser He Phe Thr Asn Thr Ala Ser Phe Ser Pro Ala Gin 
225 230 235 240 

Gly Val Gly Val Gin Leu Thr Arg Asn Gly Thr He He Pro Ala Asn 
245 250 255 

Asn Thr Val Ser Leu Gly Ala Val Gly Thr Ser Ala Val Ser Leu Gly 
260 265 270 

Leu Thr Ala Asn Tyr Ala Arg Thr Gly Gly Gin Val Thr Ala Gly Asn 
27 5 280 285 

Val Gin Ser He He Gly Val Thr Phe Val Tyr Gin 
2^0 295 300 

(2) INFORMATION FOR SBQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 300 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS : unknown 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

Met Lys Arg Val He Thr Leu Phe Ala Val Leu Leu Met Gly Trp Ser 
1 5 10 15 

Val Asn Ala Trp Ser Phe Ala Cys Lys Thr Ala Asn Gly Thr Ala He 
20 25 30 

Pro He Gly Gly Gly Ser Ala Aen Val Tyr Val Asn Leu Ala Pro Ala 
35 40 45 

Val Asn Val Gly Gin Asn Leu Val Val Asp Leu Ser Thr Gin He Phe 
50 55 60 
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Cys His Asn Asp Tyr Pro Glu Thr lie Thr Asp Tyr Val Thr Leu Gin 
65 70 75 80 

Arg Gly Ser Ala Tyr Gly Gly Val Leu Ser Ser Phe Ser Glu Thr Val 
85 90 95 

Lys Tyr Asn Gly Ser Ser Tyr Pro Phe Pro Thr Thr Ser Glu Thr Pro 
100 105 no 

Arg Val Val Tyr Asn Ser Arg Thr Asp Lys Pro Trp Pro Val Ala Leu 
115 120 125 

Tyr Leu Thr Pro Val Ser Ser Ala Gly Gly Val Ala He Lys Ala Gly 
130 135 140 

Ser Leu He Ala Val Leu He Leu Arg Gin Thr Asn Asn Tyr Asn Ser 
145 150 155 160 

Asp Asp Phe Gin Phe Val Trp Asn He Tyr Ala Asn Asn Asp Val Val 
165 170 175 

Val Pro Thr Gly Gly Cys Asp Val Ser Ala Arg Asp Val Thr Val Thr 
180 185 igo 

Leu Pro Asp Tyr Pro Gly Ser Val Pro He Pro Leu Thr Val Tyr Cys 
195 200 205 

Ala Lys Ser Gin Asn Leu Gly Tyr Tyr Leu Ser Gly Thr Asp Ala Asp 
210 215 220 

Ala Gly Asn Ser He Phe Thr Asn Thr Ala Ser Phe Ser Pro Ala Gin 
225 230 235 240 

Gly Val Gly Val Gin Leu Thr Arg Asn Gly Thr He He Pro Ala Asn 
245 250 255 

Asn Thr Val Ser Leu Gly Ala Val Gly Thr Ser Ala Val Ser Leu Gly 
260 265 270 

Leu Thr Ala Asn Tyr Ala Arg Thr Gly Gly Gin Val Thr Ala Gly Asn 
275 280 285 

Val Gin Ser He He Gly Val Thr Phe Val Tyr Gin 



(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 300 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



290 



295 



300 
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Met Lys Arg Val lie Asn Leu Phe Ala Val Leu Leu Met Gly Trp Ser 
1 S 10 15 

Val Asn Ala Trp Ser Phe Ala Cys Lys Thr Ala Asn Gly Thr Ala lie 
20 25 30 

Pro lie Gly Gly Gly Ser Ala Asn Val Tyr Val Asn Leu Ala Pro Ala 
35 40 45 

Val Asn Val Gly Gin His Leu Val Val Asp Leu Ser Thr Gin lie Phe 
50 55 60 

Cys His Asn Asp Tyr Pro Glu Thr lie Thr Asp Tyr Val Thr Leu Gin 
65 70 75 80 

Arg Gly Ser Ala Tyr Gly Gly Val Leu Ser Asn Phe Ser Gly Thr Val 
85 90 95 

Lys Tyr Ser Gly Ser Ser Tyr Pro Phe Pro Thr Thr Ser Glu Thr Leu 
100 105 110 

Arg Val Val Tyr Asn Ser Arg Thr Asp Lys Pro Trp Pro Val Ala Leu 
115 120 125 

Tyr Leu Thr Pro Val Ser Ser Ala Gly Gly Val Ala lie Lys Ala Gly 
130 135 140 

Ser Leu lie Ala Val Leu He Leu Arg Gin Thr Asn Asn Tyr Asn Ser 
145 150 155 160 

Asp Asp Phe Gin Phe Val Trp Asn He Tyr Ala Asn Asn Asp Val Val 
165 170 175 

Val Pro Thr Gly Gly Cys Asp Val Ser Ala Arg Asp Val Thr Val Thr 
180 185 190 

Leu Pro Asp Tyr Pro Gly Ser Val Pro He Pro Leu Thr Val Tyr Cys 
195 200 205 

Ala Lys Ser Gin Asn Leu Gly Tyr Tyr Leu Ser Gly Thr His Ala Asp 
210 215 220 

Ala Gly Asn Ser He Phe Thr Asn Thr Ala Ser Phe Ser Pro Ala Gin 
225 230 235 240 

Gly Val Gly Val Gin Leu Thr Arg Asn Gly Thr He He Pro Ala Asn 
245 250 255 

Asn Thr Val Ser Leu Gly Ala Val Gly Thr Ser Ala Val Ser Leu Gly 
260 265 270 

Leu Thr Ala Asn Tyr Ala Arg Thr Gly Gly Gin Val Thr Ala Gly Asn 
275 280 285 

Val Gin Ser He He Gly Val Thr Phe Val Tyr Gin 
290 295 300 

(2) INFORMATION FOR SEQ ID NO: 31: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 296 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNES S : unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

Met Lys Arg Val lie Thr Leu Phe Ala Val Leu Leu Met Gly Trp Ser 
1 5 10 15 

Val Asn Ala Trp Ser Phe Ala Cys Lys Thr Ala Asn Gly Thr Ala lie 
20 25 30 

Pro He Gly Gly Gly Ser Ala Asn Val Tyr Val Asn Leu Ala Pro Ala 
35 40 45 

Val Asn Val Gly Gin Asn Leu Val Val Asp Leu Ser Thr Gin He Phe 
50 55 60 

Cys His Asn Asp Tyr Pro Glu Thr He Thr Asp Tyr Val Thr Leu Gin 
65 70 75 80 

Arg Gly Ser Ala Tyr Gly Gly Val Leu Ser Asn Phe Ser Gly Thr Val 
85 90 95 

Lys Tyr Ser Gly Ser Ser Tyr Pro Phe Pro Thr Thr Ser Glu Thr Pro 
100 105 no 

Arg Val Val Tyr Asn Ser Arg Thr Asp Lys Pro Trp Pro Val Ala Leu 
115 120 125 

Tyr Leu Thr Pro Val Ser Ser Ala Gly Lys Ala Gly Ser Leu He Ala 
130 135 140 

Val Leu He Leu Arg Gin Thr Asn Asn Tyr Asn Ser Asp Asp Phe Gin 
145 150 155 160 

Phe Val Trp Asn He Tyr Ala Asn Asn Asp Val Val Val Pro Thr Gly 
165 170 175 

Gly Cys Asp Val Ser Ala Arg Asp Val Thr Val Thr Leu Pro Asp Tyr 
180 185 190 

Pro Gly Ser Val Pro He Pro Leu Thr Val Tyr Cys Ala Lys Ser Gin 
195 200 205 

Asn Leu Gly Tyr Tyr Leu Ser Gly Thr His Ala Asp Ala Gly Asn Ser 
210 215 220 

He Phe Thr Asn Thr Ala Ser Phe Ser Pro Ala Gin Gly Val Gly Val 
225 230 235 240 

Gin Leu Thr Arg Asn Gly Thr He He Pro Ala Asn Asn Thr Val Ser 
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245 250 255 

Leu Gly Ala Val Gly Thr Ser Ala Val Ser Leu Gly Leu Thr Ala Asn 
260 265 270 

Tyr Ala Arg Thr Gly Gly Gin Val Thr Ala Gly Asn Val Gin Ser He 
275 280 285 

He Gly Val Thr Phe Val Tyr Gin 
290 295 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 300 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

Met Lys Arg Val lie Thr Leu Phe Ala Val Leu Leu Met Gly Trp Ser 
15 10 is 

Val Asn Ala Trp Ser Phe Ala Cys Lys Thr Ala Asn Gly Thr Ala He 
20 25 30 

Pro He Gly Gly Gly Ser Ala Asn Val Tyr Val Asn Leu Ala Pro Val 
35 40 45 

Val Asn Val Gly Gin Asn Leu Val Val Asp Leu Ser Thr Gin He Phe 
50 55 60 

Cys His Asn Asp Tyr Pro Glu Thr He Thr Asp Tyr Val Thr Arg Gin 
65 70 75 80 

Arg Gly Ser Ala Tyr Gly Gly Val Leu Ser Asn Phe Ser Gly Thr Val 
85 90 95 

Lys Tyr Ser Gly Ser Ser Tyr Pro Phe Pro Thr Thr Ser Glu Thr Pro 
100 105 no 

Arg Val Val Tyr Asn Ser Arg Thr Asp Lys Pro Trp Pro Val Ala Leu 
115 120 125 

Tyr Leu Thr Pro Val Ser Ser Ala Gly Gly Val Ala He Lys Ala Gly 
130 135 140 

Ser Leu He Ala Val Leu He Leu Arg Gin Thr Asn Asn Tyr Asn Ser 
145 150 155 160 

Asp Asp Phe Gin Phe Val Trp Asn He Tyr Ala Asn Asn Asp Val Val 
165 170 175 
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Val Pro Thr Gly Gly Cys Asp Val Ser Ala Arg Asp Val Thr Val Thr 
180 185 190 

Leu Pro Asp Tyr Pro Gly Ser Val Pro He Pro Leu Thr Val Tyr Cys 
195 200 205 

Ala Lys Ser Gin Asn Leu Gly Tyr Tyr Leu Ser Gly Thr His Ala Asp 
210 215 220 

Ala Gly Asn Ser He Phe Thr Asn Thr Ala Ser Phe Ser Pro Ala Gin 
225 230 235 240 

Gly Val Gly Val Gin Leu Thr Arg Asn Gly Thr He He Pro Ala Asn 
. 245 250 255 

Asn Thr Val Ser Leu Gly Ala Val Gly Thr Ser Ala Val Ser Leu Gly 
260 265 270 

Leu Thr Ala Asn Tyr Ala Arg Thr Gly Gly Gin Val Thr Ala Gly Asn 
275 280 285 

Val Gin Ser He He Gly Val Thr Phe Val Tyr Gin 
290 235 300 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 00 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

Met Lys Arg Val lie Asn Leu Phe Ala Val Leu Leu Met Gly Trp Ser 
15 10 15 

Val Asn Ala Trp Ser Phe Ala Cys Lys Thr Ala Asn Gly Thr Ala He 
20 2S 30 

Pro He Gly Gly Gly Ser Ala Asn Val Tyr Val Asn Leu Ala Pro Ala 
35 40 45 

Val Asn Val Gly Gin Asn Leu Val Val Asp Leu Ser Thr Gin He Phe 
50 55 60 

Cys His Asn Asp Tyr Pro Glu Thr He Thr Asp Tyr Val Thr Leu Gin 
65 70 75 80 

Arg Gly Ser Ala Tyr Gly Gly Val Leu Ser Asn Phe Ser Gly Thr Val 
85 90 95 

Lys Tyr Ser Gly Ser Ser Tyr Pro Phe Pro Thr Thr Ser Glu Thr Pro 
100 105 no 
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Arg Val Val Tyr Asn Ser Arg Thr Asp Lys Pro Trp Pro Val Ala Leu 

115 120 125 

Tyr Leu Thr Pro Val Ser Ser Ala Gly Gly Val Val lie Lys Ala Gly 
130 135 140 

Ser Leu He Ala Val Leu He Leu Arg Gin Thr Asn Asn Tyr Asn Ser 
"5 150 155 160 

Asp Asp Phe Gin Phe Val Trp Asn He Tyr Ala Asn Asn Asp Val Val 
165 170 175 

Val Pro Thr Gly Gly Cys Asp Val Ser Ala Arg Asp Val Thr Val Thr 
180 185 190 

Leu Pro Asp Tyr Pro Gly Ser Val Pro He Pro Leu Thr Val Tyr Cys 
195 200 205 

Ala Lys Ser Gin Asn Leu Gly Tyr Tyr Leu Ser Gly Thr His Ala Asp 
210 215 220 

Ala Gly Asn Ser He Phe Thr Asn Thr Ala Ser Phe Ser Pro Ala Gin 
225 230 235 240 

Gly Val Gly Val Gin Leu Thr Arg Asn Gly Thr He He Pro Ala Asn 
245 250 255 

Asn Thr Val Ser Leu Gly Ala Val Gly Thr Ser Ala Val Ser Leu Gly 
260 265 270 

Leu Thr Ala Asn Tyr Ala Arg Thr Gly Gly Gin Val Thr Ala Gly Asn 
275 280 285 

Val Gin Ser He He Gly Val Thr Phe Val Tyr Gin 
290 295 300 

(2) INFORMATION FOR SEQ ID NO: 34; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 300 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

Met Lys Arg Val He Thr Leu Phe Ala Val Leu Leu Met Gly Trp Ser 
1 5 io is 

Val Asn Ala Trp Ser Phe Ala Cys Lys Thr Ala Asn Gly Thr Ala He 
20 25 30 

Pro He Gly Gly Gly Ser Ala Asn Val Tyr Val Asn Leu Ala Pro Ala 
35 40 45 
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Val Asn Val Gly Gin Asn Leu Val Val Asp Leu Ser Thr Gin He Phe 
50 55 60 

Cys His Asn Asp Tyr Pro Glu Thr He Thr Asp Tyr Val Thr Leu Gin 
65 70 75 80 

Arg Gly Ser Ala Tyr Gly Gly Val Leu Ser Ser Phe Ser Gly Thr Val 
85 90 95 

Lys Tyr Asn Gly Ser Ser Tyr Pro Phe Pro Thr Thr Ser Glu Thr Pro 
100 105 110 

Arg Val Val Tyr Asn Ser Arg Thr Asp Lys Pro Trp Pro Val Ala Leu 
115 120 125 

Tyr Leu Thr Pro Val Ser Ser Ala Gly Gly Val Ala He Lys Ala Gly 
130 135 140 

Ser Leu He Ala Val Leu He Leu Arg Gin Thr Asn Asn Tyr Asn Ser 
145 150 155 160 

Asp Asp Phe Gin Phe Val Trp Asn He Tyr Ala Asn Asn Asp Val Val 
165 170 175 

Val Pro Thr Gly Gly Cys Asp Val Ser Ala Arg Asp Val Thr Val Thr 
1*0 185 190 

Leu Pro Asp Tyr Pro Gly Ser Val Pro He Pro Leu Thr Val Tyr Cys 
I 95 200 205 

Ala Lys Ser Gin Asn Leu Gly Tyr Tyr Leu Ser Gly Thr His Ala Asp 
210 215 220 

Ala Gly Asn Ser He Phe Thr Asn Thr Ala Ser Phe Ser Pro Ala Gin 
225 230 235 240 

Gly Val Gly Val Gin Leu Thr Arg Asn Gly Thr He He Pro Ala Asn 
245 250 255 

Asn Thr Val Ser Leu Gly Ala Val Gly Thr Ser Ala Val Ser Leu Gly 
260 265 270 

Leu Thr Ala Asn Tyr Ala Arg Thr Gly Gly Gin Val Thr Ala Gly Asn 
275 280 285 

Val Gin Ser He He Gly Val Thr Phe Val Tyr Gin 
290 295 300 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 300 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

Met Lys Arg Val He Thr Leu Phe Ala Val Leu Leu Met Gly Trp Ser 
1 5 10 15 

Val Asn Ala Trp Ser Phe Ala Cys Lys Thr Ala Asn Gly Thr Ala He 
20 25 30 

Pro He Gly Gly Gly Ser Ala Asn Val Tyr Val Asn Leu Ala Pro Ala 
35 40 45 

Val Asn Val Gly Gin Asn Leu Val Val Asp Leu Ser Thr Gin He Phe 
50 55 go 

Cys His Asn Asp Tyr Pro Glu Thr He Thr Asp Tyr Val Thr Leu Gin 
65 70 75 80 

Arg Gly Ser Ala Tyr Gly Gly Val Leu Ser Asn Phe Ser Gly Thr Val 
85 90 95 

Lys Tyr Ser Gly Ser Ser Tyr Pro Phe Pro Thr Thr Ser Glu Thr Pro 
iOO 105 no 

Arg Val Val Tyr Asn Ser Arg Thr Asp Lys Pro Trp Pro Val Ala Leu 
H5 120 125 

Tyr Leu Thr Pro Val Ser Ser Ala Gly Gly Val Ala He Lys Ala Gly 
"0 135 140 

Ser Leu He Ala Val Leu He Leu Arg Gin Thr Asn Asn Tyr Asn Ser 
145 150 155 160 

Asp Asp Phe Gin Phe Val Trp Asn He Tyr Ala Asn Asn Asp Val Val 
165 170 175 

Val Pro Thr Gly Gly Cys Asp Val Ser Ala His Asp Val Thr Val Thr 
180 185 190 

Leu Pro Asp Tyr Pro Gly Ser Val Pro He Pro Leu Thr Val Tyr Cys 
195 200 205 

Ala Lys Ser Gin Asn Leu Gly Tyr Tyr Leu Ser Gly Thr His Ala Asp 
210 215 220 

Ala Gly Asn Ser He Phe Thr Asn Thr Ala Ser Phe Ser Pro Ala Gin 
225 230 235 240 

Gly Val Gly Val Gin Leu Thr Arg Asn Gly Thr He He Pro Ala Asn 
245 250 255 

Asn Thr Val Ser Leu Gly Ala Val Gly Thr Ser Ala Val Ser Leu Gly 
260 265 270 

Leu Thr Ala Asn Tyr Ala Arg Thr Gly Gly Gin Val Thr Ala Gly Asn 
275 280 285 

Val Gin Ser He He Gly Val Thr Phe Val Tyr Gin 
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(2) INFORMATION FOR SEQ ID NO: 36: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 300 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

Met Lys Arg Val lie Thr Leu Phe Ala Val Leu Leu Met Gly Trp Ser 
1 5 io is 

Val Asn Ala Trp Ser Phe Ala Cys Lys Thr Ala Asn Gly Thr Ala He 
20 25 30 

Pro He Gly Gly Gly Ser Ala Asn Val Tyr Val Asn Leu Ala Pro Ala 
35 40 45 

Val Asn Val Gly Gin Asn Leu Val Val Asp Leu Ser Thr Gin He Phe 
50 55 60 

Cys His Asn Asp Tyr Pro Glu Thr He Thr Asp Tyr Val Thr Leu Gin 
65 70 75 80 

Arg Gly Ser Ala Tyr Gly Gly Val Leu Ser Asn Phe Ser Gly Thr Val 
85 90 95 

Lys Tyr Ser Gly Ser Ser Tyr Pro Phe Pro Thr Thr Ser Glu Thr Pro 
100 105 no 

Arg Val Val Tyr Asn Ser Arg Thr Asp Lys Pro Trp Pro Val Ala Arg 
115 120 125 

Tyr Leu Thr Pro Val Ser Ser Ala Gly Gly Val Ala He Lys Ala Gly 
130 135 140 

Ser Leu He Ala Val Leu He Leu Arg Gin Thr Asn Asn Tyr Asn Ser 
145 150 155 i 6 o 

Asp Asp Phe Gin Phe Val Trp Asn He Tyr Ala Asn Asn Asp Val Val 
165 170 175 

Val Pro Thr Gly Gly Cys Asp Val Ser Ala Arg Asp Val Thr Val Thr 
180 185 190 

Leu Pro Asp Tyr Pro Gly Ser Val Pro He Pro Leu Thr Val Tyr Cys 
195 200 205 

Ala Lys Ser Gin Asn Leu Gly Tyr Tyr Leu Ser Gly Thr His Ala Asp 
210 215 220 
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Ala Gly Asn Ser lie Phe Thr Asn Thr Ala Ser Phe Ser Pro Ala Gin 
225 230 235 240 

Gly Val Gly Val Gin Leu Thr Arg Asn Gly Thr lie He Pro Ala Asn 
24 5 250 255 

Asn Thr Val Ser Leu Gly Ala Val Gly Thr Ser Ala Val Ser Leu Gly 
260 265 270 

Leu Thr Ala Asn Tyr Ala Arg Thr Gly Gly Gin Val Thr Ala Gly Asn 
27 5 280 285 

Val Gin Ser He He Gly Val Thr Phe Val Tyr Gin 
290 295 300 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 300 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

Met Lys Arg Val He Thr Leu Phe Ala Val Leu Leu Met Gly Trp Ser 
1 5 10 15 

Val Asn Ala Trp Ser Phe Ala Cys Lys Thr Ala Asn Gly Thr Ala He 
20 25 3 0 

Pro He Gly Gly Gly Ser Ala Asn Val Tyr Val Asn Leu Ala Pro Ala 
35 40 45 

Val Asn Val Gly Gin Asn Leu Val Val Asp Leu Ser Thr Gin He Phe 
50 55 go 

Cys His Asn Asp Tyr Pro Glu Thr He Thr Asp Tyr Val Thr Leu Gin 
65 ™ 75 so 

Arg Gly Ser Ala Tyr Gly Gly Val Leu Ser Asn Phe Ser Gly Thr Val 
85 9 o 95 

Lys Tyr Ser Gly Ser Ser Tyr Pro Phe Pro Thr Thr Ser Glu Thr Pro 
100 105 no 

Arg Val Val Tyr Asn Ser Arg Thr Asp Lys Pro Trp Pro Val Ala Leu 
115 120 125 

Tyr Leu Thr Pro Val Ser Ser Ala Gly Gly Val Ala He Lys Ala Gly 
130 135 140 

Ser Leu He Ala Val Leu He Leu Arg Gin Thr Asn Asn Tyr Asn Ser 
145 150 155 iso 
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Asp Asp Phe Gin Phe Val Trp Asn lie Tyr Ala Asn Asn Asp Val Val 
165 170 175 

Val Pro Thr Gly Gly Cys Asp Val Ser Ala His Asp Val Thr Val Thr 
180 185 190 

Leu Pro Asp Tyr Pro Gly Ser Val Pro lie Pro Leu Thr Val Tyr Cys 
155 200 205 

Ala Lys Ser Gin Asn Leu Gly Tyr Tyr Leu Ser Gly Thr His Ala Asp 
210 215 220 

Ala Gly Asn Ser He Phe Thr Asn Thr Ala Ser Phe Ser Pro Ala Gin 
225 230 235 240 

Gly Val Gly Val Gin Leu Thr Arg Asn Gly Thr He He Pro Ala Asn 
245 250 255 

Asn Thr Val Ser Leu Gly Ala Val Gly Thr Ser Ala Val Ser Leu Gly 
260 265 270 

Leu Thr Ala Asn Tyr Ala Arg Thr Gly Gly Gin Val Thr Ala Gly Asn 
275 280 285 

Val Gin Ser He He Gly Val Thr Phe Val Tyr Gin 
290 295 300 

(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 300 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:38: 

Met Lys Arg Val He Thr Leu Phe Ala Val Leu Leu Met Gly Trp Ser 
1 5 io 



15 



Val Asn Ala Trp Ser Phe Ala Cys Lys Thr Ala Asn Gly Thr Ala He 
20 25 30 

Pro He Gly Gly Gly Ser Ala Asn Val Tyr Val Asn Leu Ala Pro Ala 
35 40 45 

Val Asn Val Gly Gin Asn Leu Val Val Asp Leu Ser Thr Gin He Phe 
50 55 60 

Cys His Asn Asp Tyr Pro Glu Thr He Thr Asp Tyr Val Thr Leu Gin 
65 70 75 so 

Arg Gly Ser Ala Tyr Gly Gly Val Leu Ser Asn Phe Ser Gly Thr Val 
85 90 95 
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Lys Tyr Ser Gly Ser Ser Tyr Pro Phe Pro Thr Thr Ser Glu Thr Pro 
100 105 110 

Arg Val Val Tyr Asn Ser Arg Thr Asp Lys Pro Trp Pro Val Ala Leu 
115 120 125 

Tyr Leu Thr Pro Val Ser Ser Ala Gly Gly Val Ala lie Lys Ala Gly 
130 135 140 

Ser Leu lie Ala Val Leu lie Leu Arg Gin Thr Asn Asn Tyr Asn Ser 
145 150 155 160 

Asp Asp Phe Gin Phe Val Trp Asn He Tyr Ala Asn Asn Asp Val Val 
165 170 175 

Val Pro Thr Gly Gly Cys Asp Val Ser Ala His Asp Val Thr Val Thr 
180 185 190 

Leu Pro Asp Tyr Pro Gly Ser Val Pro He Pro Leu Thr Val Tyr Cys 
195 200 205 

Ala Lys Ser Gin Asn Leu Gly Tyr Tyr Leu Ser Gly Thr His Ala Asp 
210 215 220 

Ala Gly Asn Ser He Phe Thr Asn Thr Ala Ser Phe Ser Pro Ala Gin 
225 230 235 240 

Gly Val Gly Val Gin Leu Thr Arg Asn Gly Thr He He Pro Ala Asn 
245 250 255 

Asn Thr Val Ser Leu Gly Ala Val Gly Thr Ser Ala Val Ser Leu Gly 
260 265 270 

Leu Thr Ala Asn Tyr Ala Arg Thr Gly Gly Gin Val Thr Ala Gly Asn 
275 280 285 

Val Gin Ser He He Gly Val Thr Phe Val Tyr Gin 
290 295 300 

(2) INFORMATION FOR SEQ ID NO; 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 300 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : unknown 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

Met Lys Arg Val lie Thr Leu Phe Ala Val Leu Leu Met Gly Trp Ser 
1 5 10 15 

Val Asn Ala Trp Ser Phe Ala Cys Lys Thr Ala Asn Gly Thr Ala He 
20 25 30 
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Pro lie Gly Gly Gly Ser Ala Asn Val Tyr Val Asn Leu Ala Pro Ala 
35 40 45 

Val Asn Val Gly Gin Asn Leu Val Val Asp Leu Ser Thr Gin lie Phe 
50 55 60 

Cys His Asn Asp Tyr Pro Glu Thr lie Thr Asp Tyr Val Thr Leu Gin 
65 70 75 80 

Arg Gly Ser Ala Tyr Gly Gly Val Leu Ser Ser Phe Ser Gly Thr Val 
85 90 95 

Lys Tyr Asn Gly Ser Ser Tyr Pro Phe Pro Thr Thr Ser Glu Thr Pro 
100 105 110 

Arg Val Val Tyr Asn Ser Arg Thr Asp Lys Pro Trp Pro Val Ala Leu 
115 120 125 

Tyr Leu Thr Pro Val Ser Ser Ala Gly Gly Val Ala lie Lys Ala Gly 
130 135 140 

Ser Leu He Ala Val Leu He Leu Arg Gin Thr Asn Asn Tyr Asn Ser 
145 150 155 160 

Asp Asp Phe Gin Phe Val Trp Asn He Tyr Ala Asn Asn Asp Val Val 
165 170 175 

Val Pro Thr Gly Gly Cys Asp Ala Ser Ala Arg Asp Val Thr Val Thr 
180 185 190 

Leu Pro Asp Tyr Pro Gly Ser Val Pro He Pro Leu Thr Val Tyr Cys 
195 200 205 

Ala Lys Ser Gin Asn Leu Gly Tyr Tyr Leu Ser Gly Thr His Ala Asp 
210 215 220 

Ala Gly Asn Ser He Phe Thr Asn Thr Ala Ser Phe Ser Pro Ala Gin 
225 230 235 240 

Gly Val Gly Val Gin Leu Thr Arg Asn Gly Thr He He Pro Ala Asn 
245 250 255 

Asn Thr Val Ser Leu Gly Ala Val Gly Thr Ser Ala Val Ser Leu Gly 
260 265 270 

Leu Thr Ala Asn Tyr Ala Arg Thr Gly Gly Gin Val Thr Ala Gly Asn 
275 280 285 

Val Gin Ser He He Gly Val Thr Phe Val Tyr Gin 
290 295 300 

(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 300 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE : peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:40: 

Met Lys Arg Val lie Thr Leu Phe Ala Val Leu Leu Met Gly Trp Ser 
1 5 10 15 

Val Asn Ala Trp Ser Phe Ala Cys Lys Thr Ala Asn Gly Thr Ala He 
20 25 30 

Pro He Gly Gly Gly Ser Ala Asn Val Tyr Val Asn Leu Ala Pro Ala 
35 40 45 

Val Asn Val Gly Gin Asn Leu Val Val Asp Leu Ser Thr Gin He Phe 
50 55 60 

Cys His Asn Asp Tyr Pro Glu Thr He Thr Asp Tyr Val Thr Leu Gin 
65 70 75 80 

Arg Gly Ser Ala Tyr Gly Asp Val Leu Ser Ser Phe Ser Gly Thr Val 
85 90 95 

Lys Tyr Asn Gly Ser Ser Tyr Pro Phe Pro Thr Thr Ser Glu Thr Pro 
100 105 no 

Arg Val Val Tyr Asn Ser Arg Thr Asp Lys Pro Trp Pro Val Ala Leu 
US 120 125 

Tyr Leu Thr Pro Val Ser Ser Ala Gly Gly Val Ala He Lys Ala Gly 
130 135 140 

Ser Leu He Ala Val Leu He Leu Arg Gin Thr Asn Asn Tyr Asn Ser 
145 150 155 160 

Asp Asp Phe Gin Phe Val Trp Asn He Tyr Ala Asn Asn Asp Val Val 
165 170 175 

Val Pro Thr Gly Gly Cys Asp Ala Ser Ala Arg Asp Val Thr Val Thr 
180 185 190 

Leu Pro Asp Tyr Pro Gly Ser Val Pro He Pro Leu Thr Val Tyr Cys 
195 200 205 

Ala Lys Ser Gin Asn Leu Gly Tyr Tyr Leu Ser Gly Thr His Ala Asp 
210 215 220 

Ala Gly Asn Ser He Phe Thr Asn Thr Ala Ser Phe Ser Pro Ala Gin 
225 230 235 240 

Gly Val Gly Val Gin Leu Thr Arg Asn Gly Thr He He Pro Ala Asn 
245 250 255 

Asn Thr Val Ser Leu Gly Ala Val Gly Thr Ser Ala Val Ser Leu Gly 
260 265 270 

Leu Thr Ala Asn Tyr Ala Arg Thr Gly Gly Gin Val Thr Ala Gly Asn 
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275 280 285 

Val Gin Ser lie lie Gly Val Thr Phe Val Tyr Gin 
290 295 300 

(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 900 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 



ATGAAACGAG 


TTATTACCCT 


GTTTGCTGTA 


CTGCTGATGG 


GCTGGTCGGT 


AAATGCCTGG 


60 


TCATTCGCCT 


GTAAAACCGC 


CAATGGTACC 


GCTATCCCTA 


TTGGCGGTGG 


CAGCGCCAAT 


120 


GTTTATGTAA 


ACCTTGCGCC 


CGTCGTGAAT 


GTGGGGCAAA ACCTGGTCGT 


GGATCTTTCG 


180 


ACGCAAATCT 


TTTGCCATAA 


CGATTATCCG 


GAAACCATTA 


CAGACTATGT 


CACACTGCAA 


240 


CGAGGCTCGG 


CTTATGGCGG 


CGTGTTATCT 


AA'rrrrrccG 


GGACCGTAAA ATATAGTGGC 


300 


AGTAGCTATC 


CATTTCCTAC 


CACCAGCGAA 


ACGCCGCGCG 


TTGTTTATAA 


TTCGAGAACG 


360 


GATAAGCCGT 


GGCCGGTGGC 


GCTTTATTTG 


ACGCCTGTGA 


GCAGTGCGGG 


CGGGGTGGCG 


420 


ATTAAAGCTG 


GCTCATTAAT 


TGCCGTGCTT 


ATTTTGCGAC 


AGACCAACAA 


CTATAACAGC 


480 


GATGATTTCC 


AGTTTGTGTG 


GAATATTTAC 


GCCAATAATG 


ATGTGGTGGT 


GCCTACTGGC 


540 


GGCTGCGATG 


TTTCTGCTCG 


TGATGTCACC 


GTTACTCTGC 


CGGACTACCG 


TGGTTCAGTG 


600 


CCAATTCCTC 


TTACCGTTTA 


TTGTGCGAAA AGCCAAAACC 


TGGGGTATTA 


CCTCTCCGGC 


660 


ACACACGCAG 


ATGCGGGCAA 


CTCGATTTTC 


ACCAATACCG 


CGTCGTTTTC 


ACCTGCACAG 


720 


GGCGTCGGCG 


TACAGTTGAC 


GCGCAACGGT 


ACGATTATTC 


CAGCGAATAA 


CACGGTATCG 


780 


TTAGGAGCAG 


TAGGGACTTC 


GGCGGTGAGT 


CTGGGATTAA 


CGGCAAATTA 


TGCACGTACC 


840 


GGAGGGCAGG 


TGACTGCAGG 


GAATGTGCAA 


TCGATTATTG 


GCGTGACTTT 


TGTTTATCAA 


900 



( 2 ) INFORMATION FOR SEQ ID NO : 4 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 900 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

ATGAAACGAG TTATTACCCT GTTTGCTGTA CTGCTGATGG GCTGGTCGGT AAATGCCTGG 60 

TCATTCGCCT GTAAAACCGC CAATGGTACC GCAATCCCTA TTGGCGGTGG CAGCGCCAAT 120 

GTTTATGTAA ACCTTGCGCC TGCCGTGAAT GTGGGGCAAA ACCTGGTCGT AGATCTTTCG 180 

ACGCAAATCT TTTGCCATAA CGATTACCCA GAAACCATTA CAGACTATGT CACACTGCAA 240 

CGAGGTTCGG CTTATGGCGG CGTGTTATCT AGTTTTTCCG GGACCGTAAA ATATAATGGC 300 

AGTAGCTATC CTTTCCCTAC TACCAGCGAA ACGCCGCGGG TTGTTTATAA TTCGAGAACG 360 

GATAAGCCGT GGCCGGTGGC GCTTTATTTG ACGCCGGTGA GCAGTGCGGG GGGAGTGGCG 420 

ATTAAAGCTG GCTCATTAAT TGCCGTGCTT ATTTTGCGAC AGACCAACAA CTATAACAGC 480 

GATGATTTCC AGTTTGTGTG GAATATTTAC GCCAATAATG ATGTGGTGGT GCCCACTGGC 540 

GGCTGCGATG TTTCTGCTCG TGATGTCACC GTTACTCTGC CGGACTACCC TGGTTCAGTG 600 

CCGATTCCTC TTACCGTTTA TTGTGCGAAA AGCCAAAACC TGGGGTATTA CCTCTCCGGC 660 

ACACACGCAG ATGCGGGCAA CTCGATTTTC ACCAATACCG CGTCGTTTTC ACCCGCGCAG 720 

GGCGTCGGCG TACAGTTGAC GCGCAACGGT ACGATTATTC CAGCGAATAA CACGGTATCG 780 

TTAGGAGCAG TAGGGACTTC GGCGGTAAGT CTGGGATTAA CGGCAAATTA CGCACGTACC 84 0 

GGAGGGCAGG TGACTGCAGG GAATGTGCAA TCGATTATTG GCGTGACTTT TGTTTATCAA 900 

(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 900 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 

ATGAAACGAG TTATTACCCT GTTTGCTGTA CTGCTGATGG GCTGGTCGGT AAATGCCTGG 60 

TCATTCGCCT GTAAAACCGC CAATGGTACC GCAATCCCTA TTGGCGGTGG CAGCGCCAAT 120 

GTTTATGTAA ACCTTGCGCC TGCCGTGAAT GTGGGGCAAA ACCTGGTCGT AGATCTTTCG 180 

ACGCAAATCT TTTGCCATAA CGATTACCCA GAAACCATTA CAGACTATGT CACACTGCAA 240 
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CGAGGTTCGG CTTATGGCGG CGTGTTATCT AGTTTTTCCG GGACCGTAAA ATATAATGGC 
AGTAGCTATC CTTTCCCTAC TACCAGCGAA ACGCCGCGGG TTGTTTATAA TTCGAGAACG 
GATAAGCCGT GGCCGGTGGC GCTTTATTTG ACGCCGGTGA GCAGTGCGGG GGGAGTGGCG 
ATTAAAGCTG GCTCATTAAT TGCCGTGCTT ATTTTGCGAC AGACCAACAA CTATAACAGC 
GATGATTTCC AGTTTGTGTG GAATATTTAC GCCAATAATG ATGTGGTGGT GCCCACTGGC 
GGCTGTGATG CTTCTGCTCG TGATGTCACC GTTACTCTGC CGGACTACCC TGGTTCAGTG 
CCGATTCCTC TTACCGTTTA TTGTGCGAAA AGCCAAAACC TGGGGTATTA CCTATCCGGC 
ACACATGCAG ATGCGGGCAA CTCGATTTTC ACCAATACCG CGTCGTTTTC ACCCGCGCAG 
GGCGTCGGCG TACAGTTGAC GCGCAACGGT ACGATTATTC CAGCGAATAA CACGGTATCG 
TTAGGAGCAG TAGGGACTTC GGCGGTGAGT CTGGGATTAA CGGCAAATTA TGCACGTACC 
GGAGGGCAGG TGACTGCAGG GAATGTGCAA TCGATTATTG GCGTGACTTT TGTTTATCAA 

(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 900 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNES S : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 
ATGAAACGAG TTATTACCCT GTTTGCTGTA CTGCTGATGG GCTGGTCGGT AAATGCCTGG 
TCATTCGCCT GTAAAACCGC CAATGGTACC GCTATCCCTA TTGGCGGTGG CAGCGCCAAT 
GTTTATGTAA ACCTTGCGCC TGCCGTGAAT GTGGGGCAAA ACCTGGTCGT GGATCTTTCG 
ACGCAAATCT TTTGCCATAA CGATTACCCG GAAACCATTA CAGACTATGT CACACTGCAA 
CGAGGTTCGG CTTATGGCGG CGTGTTATCT AGTTTTTCCG AGACCGTAAA ATATAATGGC 
AGTAGCTATC CTTTCCCTAC TACCAGCGAA ACGCCGCGGG TTGTTTATAA TTCGAGAACG 
GATAAGCCGT GGCCGGTGGC GCTTTATTTG ACGCCTGTGA GCAGTGCGGG GGGAGTGGCG 
ATTAAAGCTG GCTCATTAAT TGCCGTGCTT ATTTTGCGAC AGACCAACAA CTATAACAGC 
GATGATTTCC AGTTTGTGTG GAATATTTAC GCCAATAATG ATGTGGTGGT GCCCACTGGC 
GGCTGTGATG TTTCTGCTCG TGATGTCACC GTTACTTTGC CGGACTACCC TGGTTCAGTG 
CCGATTCCTC TTACCGTTTA TTGTGCGAAA AGCCAAAACC TGGGGTATTA CCTCTCCGGC 
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ACAGACGCAG ATGCGGGCAA CTCGATTITC ACCAATACCG CGTCGTTTTC ACCTGCACAG 720 

GGCGTCGGCG TACAGTTGAC GCGCAACGGT ACGATTATTC CAGCGAATAA CACGGTATCG 780 

TTAGGAGCAG TAGGGACTTC GGCGGTAAGT CTGGGATTAA CGGCAAATTA CGCACGTACC 840 

GGAGGGCAGG TGACTGCAGG GAATGTGCAA TCGATTATTG GCGTGACTTT TGTTTATCAA 900 

(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 900 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 
ATGAAACGAG TTATTACCCT GTTTGCTGTA CTGCTGATGG GCTGGTCGGT AAATGCCTGG 60 
TCATTCGCCT GTAAAACCGC CAATGGTACC GCAATCCCTA TTGGCGGTGG CAGCGCCAAT 120 
GTTTATGTAA ACCTTGCGCC TGCCGTGAAT GTGGGGCAAA ACCTGGTCGT AGATCTTTCG 180 
ACGCAAATCT TTTGCCATAA CGATTACCCA GAAACCATTA CAGACTATGT CACACTGCAA 240 
CGAGGTTCGG CTTATGGCGA CGTGTTATCT AGTTTTTCCG GGACCGTAAA ATATAATGGC 300 
AGTAGCTATC CTTTCCCTAC TACCAGCGAA ACGCCGCGGG TTGTTTATAA TTCGAGAACG 360 
GATAAGCCGT GGCCGGTGGC GCTTTATTTG ACGCCGGTGA GCAGTGCGGG GGGAGTGGCG 
ATTAAAGCTG GCTCATTAAT TGCCGTGCTT ATTTTGCGAC AGACCAACAA CTATAACAGC 
GATGATTTCC AGTTTGTGTG GAATATTTAC GCCAATAATG ATGTGGTGGT GCCCACTGGC 
GGCTGTGATG TCTCTGCTCG TGATGTCACC GTTACTCTGC CGGACTACCC TGGTTCAGTG 
CCGATTCCTC TTACCGTTTA TTGTGCGAAA AGCCAAAACC TGGGGTATTA CCTATCCGGC 
ACAGACGCAG ATGCGGGCAA CTCGATTTTC ACCAATACCG CGTCGTTTTC ACCCGCGCAG 
GGCGTCGGCG TACAGTTGAC GCGCAACGGT ACGATTATTC CAGCGAATAA CACGGTATCG 
TTAGGAGCAG TAGGGACTTC GGCGGTAAGT CTGGGATTAA CGGCAAATTA CGCACGTACC 
GGAGGGCAGG TGACCGCAGG GAATGTGCAA TCGATTATTG GCGTGACTTT TGTTTATCAA 



420 
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(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 900 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 
ATGAAACGAG TTATTACCCT GTTTGCTGTA CTGCTGATGG GCTGGTCGGT AAATGCCTGG 
TCATTCGCCT GTAAAACCGC CAATGGTACC GCTATTCCTA TTGGCGGTGG CAGCGCTAAT 
GTTTATGTAA ACCTTGCGCC TGCCGTGAAT GTGGGGCAAA ACCTGGTCGT AGATCTTTCG 
ACGCAAATCT TTTGCCATAA CGATTATCCG GAAACCATTA CAGACTATGT CACACTGCAA 
CGAGGCTCGG CTTATGGCGG CGTGTTATCT AATTTTTCCG GGACCGTAAA ATATAGTGGC 
AGTAGCTATC CATTCCCGAC TACCAGCGAA ACGCCGCGGG TTGTTTATAA TTCGAGAACG 
GATAAGCCGT GGCCGGTGGC GCTTTATTTG ACGCCTGTGA GCAGTGCGGG TGGGGTGGCG 
ATTAAAGCTG GCTCATTAAT TGCCGTGCTT ATTTTGCGAC AGACCAACAA CTATAACAGC 
GATGATTTCC AGTTTGTGTG GAATATTTAC GCCAATAATG ATGTGGTGGT G CCTACTGG C 
GGCTGCGATG TTTCTGCTCA TGATGTCACC GTTACTCTGC CGGACTACCC TGGTTCAGTG 
CCAATTCCTC TTACCGTTTA TTGTGCGAAA AGCCAAAACC TGGGGTATTA CCTCTCCGGC 
ACACACGCAG ATGCGGGCAA CTCGATTTTC ACCAATACCG CGTCGTTTTC ACCAGCGCAG 
GGCGTCGGCG TACAGTTGAC GCGCAACGGT ACGATTATTC CAGCGAATAA CACGGTATCG 
TTAGGAGCAG TAGGGACTTC GGCGGTAAGT CTGGGATTAA CGGCAAATTA CGCACGTACC 
GGAGGGCAGG TGACTGCAGG GAATGTGCAA TCGATTATTG GCGTGACTTT TGTTTATCAA 

(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 900 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 
ATGAAACGAG TTATTACCCT GTTTGCTGTA CTGCTGATGG GCTGGTCGGT AAATGCCTGG 
TCATTCGCCT GTAAAACCGC CAATGGTACC GCTATTCCTA TTGGCGGTGG CAGCGCTAAT 
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GTTTATGTAA ACCTTGCGCC TGCCGTGAAT GTGGGGCAAA AC CTGGTCGT AGATCTTTCG 180 
ACGCAAATCT TTTGCCATAA CGATTATCCG GAAACCATTA CAGACTATGT CACACTGCAA 240 
CGAGG CTCGG CTTATGGCGG CGTGTTATCT AATTTTTCCG GGACCGTAAA ATATAGTGGC 300 
AGTAGCTATC CATTTCCGAC TACCAGCGAA ACGCCGCGGG TTGTTTATAA TTCGAGAACG 360 
GATAAGCCGT GGCCGGTGGC GCTTTATTTG ACGCCTGTGA GCAGTGCGGG TGGGGTGGCG 420 
ATTAAAGCTG GCTCATTAAT TGCCGTGCTT ATTTTGCGAC AGACCAACAA CTATAACAGC 480 
GATGATTTCC AGTTTGTGTG GAATATTTAC GCCAATAATG ATGTGGTGGT GCCTACTGGC 540 
GGCTGCGATG TTTCTGCTCA TGATGTCACC GTTACTCTGC CGGACTACCC TGGTTCAGTG 600 
CCAATTCCTC TTACCGTTTA TTGTGCGAAA AGCCAAAACC TGGGGTATTA CCTCTCCGGC 660 
ACACACGCAG ATGCGGGCAA CTCGATTTTC ACCAATACCG CGTCGTTTTC ACCAGCGCAG 720 
GGCGTCGGCG TACAGTTGAC GCGCAACGGT ACGATTATTC CAGCGAATAA CACGGTATCG 780 
TTAGGAGCAG TAGGGACTTC GGCGGTAAGT CTGGGATTAA CGGCAAATTA CGCACGTACC 840 
GGAGGGCAGG TGACTGCAGG GAATGTGCAA TCGATTATTG GCGTGACTTT TGTTTATCAA 900 

(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 888 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 
ATGAAACGAG TTATTACCCT GTTTGCTGTA CTGCTGATGG GCTGGTCGGT AAATGCCTGG 60 

TCATTCGCCT GTAAAACCGC CAATGGTACC GCTATCCCTA TTGGCGGTGG CAGCGCCAAT 120 

GTTTATGTAA ACCTTGCGCC CGCCGTGAAT GTGGGGCAAA ACCTGGTCGT GGATCTTTCG 180 

ACGCAAATCT TTTGCCATAA CGATTATCCG GAAACCATTA CAGACTATGT CACACTGCAA 240 

CGAGG CTCGG CTTATGGCGG CGTGTTATCT AATTTTTCCG GGACCGTAAA ATATAGTGGC 300 

AGTAGCTATC CATTTCCTAC CACCAGCGAA ACGCCGCGCG TTGTTTATAA TTCGAGAACG 360 

GATAAGCCGT GGCCGGTGGC GCTTTATTTG ACGCCTGTGA GCAGTGCGGG TAAAGCTGGC 420 

TCATTAATTG CCGTGCTTAT TTTGCGACAG ACCAACAACT ATAACAGCGA TGATTTCCAG 480 

TTTGTGTGGA ATATTTACGC CAATAATGAT GTGGTGGTGC CTACTGGCGG CTGCGATGTT 540 
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TCTGCTCGGG ATGTCACCGT TACTCTGCCG GACTACCCTG GTTCAGTGCC AATTCCTCTT 600 

AC CGTTTATT GTGCGAAAAG CCAAAACCTG GGGTATTACC TCTCCGGCAC ACACGCAGAT 660 

GCGGGCAACT CGATTTTCAC CAATACCGCG TCGTTTTCAC CTGCACAGGG CGTCGGCGTA 720 

CAGTTGACGC GCAACGGTAC GATTATTCCA GCGAATAACA CGGTATCGTT AGGAGCAGTA 780 

GGGACTTCGG CGGTGAGTTT GGGATTAACG GCAAATTATG CACGTACCGG AGGGCAGGTG 84 0 

ACTGCAGGGA ATGTGCAATC GATTATTGGC GTGACTTTTG TTTATCAA 888 
(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 900 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 



ATGAAACGAG 


TTATTACCCT 


GTTTGCTGTA 


CTGCTGATGG 


GCTGGTCGGT 


AAATGCCTGG 


60 


TCATTCGCCT 


GTAAAACCGC 


CAATGGTACC 


GCTATCCCTA 


TTGGCGGTGG 


CAGCGCCAAT 


120 


GTTTATGTAA 


ACCTTGCGCC 


CGTCGTGAAT 


GTGGGGCAAA 


ACCTGGTCGT 


GGATCTTTCG 


180 


ACGCAAATCT 


TTTGCCATAA 


CGATTATCCG 


GAAACCATTA 


CAGACTATGT 


CACACTGCAA 


240 


CGAGGCTCGG 


CTTATGGCGG 


CGTGTTATCT 


AArrriTCCG 


GGACCGTAAA 


ATATAGTGGC 


300 


AGTAGCTATC 


CATTTCCTAC 


CACCAGCGAA ACGCCGCGCG 


TTGTTTATAA 


TTCGAGAACG 


360 


GATAAGCCGT 


GGC CGGTGGC 


GCTTTATTTG 


ACGCCTGTGA 


GCAGTGCGGG 


CGGGGTGGCG 


420 


ATTAAAGCTG 


GCTCATTAAT 


TGCCGTGCTT 


ATTTTGCGAC 


AGACCAACAA 


CTATAACAGC 


480 


GATGATTTCC 


AGTTTGTGTG 


GAATATTTAC 


GCCAATAATG 


ATGTGGTGGT 


GCCTACTGGC 


540 


GGCTGCGATG 


TTTCTGCTCG 


TGATGTCACC 


GTTACTCTGC 


CGGACTACCC 


TGGTTCAGTG 


600 


CCAATTCCTC 


TTACCGTTTA 


TTGTGCGAAA 


AGCCAAAACC 


TGGGGTATTA 


CCTCTCCGGC 


660 


ACACACGCAG 


ATGCGGGCAA 


CTCGATTTTC 


ACCAATACCG 


CGTCGTTTTC 


ACCTGCACAG 


720 


GGCGTCGGCG 


TACAGTTGAC 


GCGCAACGGT 


ACGAATATTC 


CAGCGAATAA 


CACGGTATCG 


780 


TTAGGAGCAG 


TAGGGACTTC 


GGCGGTGAGT 


CTGGGATTAA 


CGGCAAATTA 


TGCACGTACC 


840 


GGAGGGCAGG 


TGACTGCAGG 


GAATGTGCAA 


TCGATTATTG 


GCGTGACTTT 


TGTTTATCAA 


900 



(2) INFORMATION FOR SEQ ID NO: 50: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 900 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 
ATGAAACGAG TTATTAACCT GTTTGCTGTA CTGCTGATGG GCTGGTCGGT AAATGCCTGG 
TCATTCGCCT GTAAAACCGC CAATGGCACC GCTATCCCTA TTGGCGGTGG CAGCGCCAAT 
GTTTATGTAA ACCTTGCGCC CGCCGTGAAT GTGGGGCAAA ACCTGGTCGT GGATCTTTCG 
ACGCAAATCT TTTGCCATAA CGATTACCCG GAAACCATTA CAGATTATGT CACACTGCAA 
CGAGGCTCGG CTTATGGCGG CGTGTTATCT AATTTTTCCG GGACCGTAAA ATATAGTGGC 
AGTAGCTATC CATTTCCGAC CACCAGTGAA ACGCCGCGGG TTGTTTATAA TTCGAGAACG 
GATAAGCCGT GGCCGGTGGC GCTTTATTTG ACGCCTGTGA GCAGTGCGGG CGGGGTGGTG 
ATTAAAGCTG GCTCATTAAT TGCCGTGCTT ATTTTGCGAC AGACCAACAA CTATAACAGC 
GATGATTTCC AGTTTGTGTG GAATATTTAC GCCAATAATG ATGTGGTGGT GCCCACTGGC 
GGCTGCGATG TTTCTGCTCG TGATGTCACC GTTACTCTGC CGGACTACCC TGGTTCAGTG 
CCGATTCCTC TTACCGTTTA TTGTGCGAAA AGCCAAAACC TGGGGTATTA CCTCTCCGGC 
ACACACGCAG ATGCGGGCAA CTCGATTTTC AC CAATACCG CGTCGTTTTC ACCTGCACAG 
GGCGTCGGCG TACAGTTGAC GCGCAACGGT ACGATTATTC CAGCGAATAA CACGGTATCG 
TTAGGAGCAG TAGGGACTTC GGCGGTAAGT CTGGGATTAA CGGCAAATTA CGCACGTACC 
GGAGGGCAGG TGACTGCAGG GAATGTGCAA TCGATTATTG CCGTGACTTT TGTTTATCAA 



(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 900 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 
ATGAAACGAG TTATTAACCT GTTTGCTGTA CTGCTGATGG GCTGGTCGGT AAATGCCTGG 
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TCATTCGCCT GTAAAACCGC CAATGGCACC GCTATCCCTA TTGGCGGTGG CAGCGCCAAT 120 
GTTTATGTAA ACCTTGCGCC CGCCGTGAAT GTGGGGCAAC ACCTGGTCGT AGATCTTTCG 180 
ACGCAAATCT TTTGCCATAA CGATTACCCG GAAACCATTA CAGACTATGT CACACTGCAA 240 
CGAGGTTCGG CTTATGGCGG CGTGTTATCT AATTTTTCCG GGACCGTAAA ATATAGTGGC 300 
AGTAG CTATC CATTTCCTAC CACCAGCGAA ACGCTGCGGG TTGTTTATAA TTCGAGAACG 360 
GATAAGCCGT GGCCGGTGGC GCTTTATTTG ACGCCTGTGA GCAGTGCGGG CGGGGTGGCG 420 

ATTAAAGCTG GCTCATTAAT TGCCGTGCTT ATTTTGCGAC AGACCAACAA CTATAACAGC 480 

GATGATTTCC AGTTTGTGTG GAATATTTAC GCCAATAATG ATGTGGTGGT GCCTACTGGC 540 

GGCTGCGATG TTTCTGCTCG TGATGTCACC GTTACTCTGC CGGACTACCC TGGTTCAGTG 600 

CCAATTCCTC TTACCGTTTA TTGTGCGAAA AGCCAAAACC TGGGGTATTA CCTCTCCGGC 660 

ACACACGCAG ATGCGGGCAA CTCGATTTTC ACCAATACCG CCTCGTTTTC ACCAGCGCAG 720 

GGCGTCGGCG TACAGTTGAC GCGCAACGGT ACGATTATTC CAGCGAATAA CACGGTATCG 780 

TTAGGAGCAG TAGGGACTTC GGCGGTAAGT CTGGGATTAA CGGCAAATTA CGCACGTACC 840 

GGAGGGCAGG TGACTGCAGG GAATGTGCAA TCGATTATTG GCGTGACTTT TGTTTATCAA 900 

(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 900 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 
ATGAAACGAG TTATTACCCT GTTTGCTGTA CTGCTGATGG GCTGGTCGGT AAATGCCTGG 60 

TCATTCGCCT GTAAAACCGC CAATGGTACC GCTATCCCTA TTGGCGGTGG CAGCGCTAAT 120 

GTTTATGTAA ACCTTGCGCC TGCCGTGAAT GTGGGGCAAA ACCTGGTCGT AGATCTTTCG 180 

ACGCAAATCT TTTGCCATAA CGATTATCCG GAAACCATTA CAGACTATGT CACACTGCAA 24 0 

CGAGGCTCGG CTTATGGCGG CGTGTTATCT AATTTTTCCG GGACCGTAAA ATATAGTGGC 300 

AGTAGCTATC CATTTCCGAC TACCAGCGAA ACGCCGCGGG TTGTTTATAA TTCGAGAACG 360 

GATAAGCCGT GGCCGGTGGC GCTTTATTTG ACGCCTGTGA GCAGTGCGGG TGGGGTGGCG 420 

ATTAAAGCTG GCTCATTAAT TGCCGTGCTT ATTTTGCGAC AGACCAACAA CTATAACAGC 480 
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GATGATTTCC AGTTTGTGTG GAATATTTAC GCCAATAATG ATGTGGTGGT GCCTACTGGC 540 

GGCTGCGATG TTTCTGCTCA TGATGTCACC GTTACTCTGC CGGACTACCC TGGTTCAGTG 600 

CCAATTCCTC TTACCGTTTA TTGTGCGAAA AGCCAAAACC TGGGGTATTA CCTCTCCGGC 660 

ACACACGCAG ATGCGGGCAA CTCGATTTTC ACCAATACCG CGTCGTTTTC ACCAGCGCAG 720 

GGCGTCGGCG TACAGTTGAC GCGCAACGGT ACGATTATTC CAGCGAATAA CACGGTATCG 780 

TTAGGAGCAG TAGGGACTTC GGCGGTAAGT CTGGGATTAA CGGCAAATTA CGCACGTACC 840 

GGAGGGCAGG TGACTGCAGG GAATGTGCAA TCGATTATTG GCGTGACTTT TGTTTATCAA 900 

(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 900 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 



ATGAAACGAG 


TTATTACCCT 


GTTTGCTGTA 


CTGCTGATGG 


GCTGGTCGGT 


AAATGCCTGG 


60 


TCATTCGCCT 


GTAAAACCGC 


CAATGGTACC 


GCTATCCCTA 


TTGGCGGTGG 


CAGCGCCAAT 


120 


GTTTATGTAA ACCTTGCGCC 


CGCCGTGAAT 


GTGGGGCAAA 


ACCTGGTCGT 


GGATCTTTCG 


180 


ACGCAAATCT 


TTTGCCATAA 


CGATTATCCG 


GAAACCATTA 


CAGACTATGT 


CACACTGCAA 


240 


CGAGGCTCGG 


CTTATGGCGG 


CGTGTTATCT 


AATITTTCCG 


GGACCGTAAA ATATAGTGGC 


300 


AGTAGCTATC 


CATTTCCTAC 


CACCAGCGAA ACGCCGCGCG 


TTGTTTATAA 


TTCGAGAACG 


360 


GATAAGCCGT 


GGCCGGTGGC 


GCTTTATTTG 


ACGCCTGTGA 


GCAGTGCGGG 


CGGGGTGGCG 


420 


ATTAAAGCTG 


GCTCATTAAT 


TGCCGTGCTT 


ATTTTGCGAC 


AGACCAACAA 


CTATAACAGC 


480 


GATGATTTCC 


AGTTTGTGTG 


GAATATTTAC 


GCCAATAATG 


ATGTGGTGGT 


GCCTACTGGC 


540 


GGCTGCGATG 


TTTCTGCTCG 


TGATGTCACC 


GTTACTCTGC 


CGGACTACCC 


AGGTTCAGTG 


600 


CCAATTCCTC 


TTACCGTTTA 


TTGTGCGAAA AGCCAAAACC 


TGGGGTATTA 


CCTCTCCGGC 


660 


ACACACGCAG 


ATGCGGGCAA 


CTCGATTTTC 


ACCAATACCG 


CGTCGTTTTC 


ACCTGCACAG 


720 


GGCGTCGGCG 


TACAGTTGAC 


GCGCAACGGT 


ACGATTATTC 


CAGCGAATAA 


CACGGTATCG 


780 


TTAGGAGCAG 


TAGGGACTTC 


GGCGGTGAGT 


CTGGGATTAA 


CGGCAAATTA 


TGCACGTACC 


840 


GGAGGGCAGG 


TGACTGCAGG 


GAATGTGCAA 


TCGATTATTG 


GCGTGACTTT 


TGTTTATCAA 


900 
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(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 900 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 



ATGAAACGAG 


TTATTACCCT 


GTTTGCTGTA CTGCTGATGG 


GCTGGTCGGT 


AAATGCCTGG 


60 


TCATTCGCCT 


GTAAAACCGC 


CAATGGTACC GCTATCCCTA 


TTGGCGGTGG 


CAGCGCCAAT 


120 


GTTTATGTAA 


ACCTTGCGCC 


TGCCGTGAAT GTGGGGCAAA 


ACCTGGTCGT 


GGATCTTTCG 


180 


ACGCAAATCT 


TTTGCCATAA 


CGATTACCCG GAAACCATTA 


CAGACTATGT 


CACACTGCAA 


240 


CGAGGTTCGG 


CTTATGGCGG 


CGTGTTATCT AGTTTTTCCG 


GGACCGTAAA 


ATATAATGGC 


300 


AGTAGCTATC 


CTTTCCCTAC 


TACCAGCGAA ACGCCGCGCG 


TTGTTTATAA 


TTCGAGAACG 


360 


GATAAGCCGT 


GGCCGGTGGC 


GCTTTATTTG ACGCCTGTGA 


GCAGTGCGGG 


GGGAGTGGCG 


420 


ATTAAAGCTG 


GCTCATTAAT 


TGCCGTG CTT ATTTTGCGAC 


AGACCAACAA 


CTATAACAGC 


480 


GATGATTTCC 


AGTTTGTGTG 


GAATATTTAC GCCAATAATG 


ATGTGGTGGT 


GCCCACTGGC 


540 


GGCTGTGATG 


TTTCTGCTTG 


TGATGTCACC GTTACTTTGC 


CGGACTACCC 


TGGTTCAGTG 


600 


CCGATTCCTC 


TTACCGTTTA TTGTGCGAAA AGCCAAAACC 


TGGGGTATTA 


CCTCTCCGGC 


660 


ACACACGCAG 


ATGCGGGCAA 


CTCGATTTTC ACCAATACCG 


CGTCGTTTTC 


ACCTGCACAG 


720 


GGCGTCGGCG 


TACAGTTGAC 


GCGCAACGGT ACGATTATTC 


CAGCGAATAA 


CACGGTATCG 


780 


TTAGGAGCAG 


TAGGGACTTC 


GGCGGTAAGT CTGGGATTAA 


CGGCAAATTA 


CGCACGTACC 


840 


GGAGGGCAGG 


TGACTGCAGG 


GAATGTGCAA TCGATTATTG 


GCGTGACTTT 


TGTTTATCAA 


900 



(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 
GATCTGTTGA AGTTCCGGGT AGTCAGCATA TCGATAGTCA GAAAAAAGCT G 51 
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CLAIMS 



10 



1. A method of targeting a bacterial adhesin to a specific 
location, comprising (i) identifying in said location adhes- 
in- interacting receptor moiety which is recognizable by 
bacterial adhesins, (ii) isolating a bacterial cell that 
grows in said location and expresses an adhesin recognizing 
and interacting with said receptor moiety, and administering 
to the location the bacterial cell or the adhesin under 
conditions where the adhesin and the receptor moiety are 
brought into interacting contact whereby the adhesin is 
associated with the receptor moiety. 

2 . A method according to claim 1 wherein the receptor moiety 
is selected from the group consisting of a glycolipid, a 
glycoprotein, a protein, a polypeptide, a saccharide moiety 

15 and a peptide. 

3. A method according to claim l wherein the isolated bacte- 
rial cell expresses an adhesin having modified receptor 
moiety -binding properties relative to an adhesin natively 
expressed by the cell, the isolation of the cell comprising 
identifying in a parent bacterial cell, DNA sequence (s) 
coding for the binding domain (s) of said natively expressed 
adhesin and substituting at least one codon herein, whereby a 
modified adhesin molecule is expressed that is different in 
at least one amino acid from the adhesin expressed natively, 
and selecting a the bacterial cell expressing the modified ' 
adhesin having an altered adhesion phenotype relative to the 
natively expressed bacterial adhesin. 

4. A method according to claim l wherein the bacterial cell 
expressing an adhesin that recognizes and binds to the 
receptor moiety is a recombinant bacterial cell derived from 
a parent bacterial cell that does not produce an adhesin 
binding to said receptor, by inserting into the parent cell a 
DNA sequence coding for an adhesin binding to the receptor 



20 



25 



30 
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moiety, and selecting a bacterial cell expressing the DNA 
sequence . 

5. A method according to claim 1 wherein a non-adhesin com- 
pound is associated with the adhesin, whereby said compound 

5 is targeted with the adhesin to the location comprising the 
receptor moiety recognizable by the adhesin. 

6. A method according to claim 5 wherein the compound is 
covalently bound to the adhesin. 

7. A method according to claim 6 wherein the compound is part 
10 of a fusion protein comprising the adhesin, the compound 

being selected from the group consisting of an enzyme, an 
antibody, an epitope and a toxin. 

8. A method according to claim 5 wherein the compound is one 
associated with the adhesin by a non-covalent binding. 

15 9. A method according to claim 8 wherein the compound is 
selected from the group consisting of a pharmacologically 
active, a diagnostically active and an imaging compound. 

10. A method according to claim 1 wherein the specific loca- 
tion is a human or animal surface. 

20 11. A method according to claim 1 wherein the specific loca- 
tion is a plant surface. 

12. A method according to claim 1 wherein the bacterial cell 
expresses a recombinant bacterial adhesin variant derived 
from a naturally occurring parent adhesin, said recombinant 
25 bacterial adhesin variant having altered binding properties 
relative to the naturally occurring adhesin from which it is 
derived, the altered binding properties including binding to 
at least one receptor moiety to which the parent adhesin does 
not bind. 
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13. A method according to claim 12 wherein the adhesin vari- 
ant is derived from a naturally occurring adhesin isolated 
from a cell structure selected from the group consisting of a 
capsule, a lipopolysaccharide layer, an outer membrane pro- 
5 tein, a flagellum, a pilus, a fimbria, a non-fimbrial adhesin 
(NFA) and an af imbrial adhesin (AFA) . 



14* A method according to claim 12 or 13 wherein the adhesin 
variant is a protein having an amino acid sequence differing 
in at least one amino acid residue from its parent protein 
10 adhesin. 

15. A method according to claim 14 wherein the adhesin vari- 
ant is a FimH adhesin having an amino acid sequence which 
differs from the E. coli PC31 FimH adhesin as defined in 
Table 1 herein in at least one amino acid. 

15 16. A method according to claim 15 wherein the FimH adhesin 
is one binding to a receptor selected from the group consist- 
ing of a domain where mannosyl residues are not terminal and 
a domain devoid of saccharide. 

17. A method according to claim 15 wherein the adhesin vari- 
20 ant is a chimeric adhesin comprising amino acid sequences 

from different FimH adhesins. 

18. A method according to claim 15 wherein the FimH adhesin 
has an amino acid sequence which is selected from the group 
consisting of sequences appearing in Fig. 5 herein with 

25 designations CI#12, CI#4, CI#7 or CSH-50. 

19. A method according to claim 15 wherein the adhesin is one 
which, when tested for binding to yeast mannan (Mn) , human 
plasma f ibronectin (Fn) , periodate treated Fn and the syn- 
thetic peptide FnSpl comprising the first 30 amino acids of 

30 Fn, only binds to Mn (M class) . 
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20. A method according to claim 15 wherein the adhesin is one 
which, when tested for binding to yeast mannan (Mn) , human 
plasma fibronectin (Fn) , periodate treated Fn and the syn- 
thetic peptide FnSpl comprising the first 3 0 amino acids of 

5 Fn, binds to Mn and Fn (MF class) . 

21. A method according to claim 15 wherein the adhesin is one 
which, when tested for binding to yeast mannan (Mn) , human 
plasma fibronectin (Fn) , periodate treated Fn and the syn- 
thetic peptide FnSpl comprising the first 30 amino acid 

10 residues of Fn, binds to all of these (MFP class) . 

22. A method according to claim 15 wherein the adhesin is one 
which, when tested for binding to five Fn- fragments obtained 
by thermolysin treatment, only binds to the 40-kDa gelatin- 
binding fragment. 

15 23 . A method according to claim 22 wherein the adhesin is one 
which, when tested for binding to five Fn- fragments obtained 
by thermolysin treatment, binds to all five fragments. 

24. A method according to claim 15 wherein the adhesin is at 
least 90% homologous to the PC31 FimH adhesin. 

20 25. A method according to claim 15 wherein the adhesin is a 
chimeric adhesin comprising amino acid sequences from differ- 
ent FimH adhesins. 

26. A method according to claim 15 comprising an amino acid 
sequence which differs from the E. coli PC31 FimH adhesin by 

25 at least one amino acid occurring between residues 27 and 119 
of the mature FimH sequence. 

27. A method according to claim 15 wherein the adhesin binds 
to a receptor moiety selected from the group consisting of a 
human receptor moiety, an animal receptor moiety, a plant 

30 receptor moiety and an inanimate, non-biological receptor 
moiety. 
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28. A method according to claim 1 wherein the bacterial cell 
being targeted is a cell comprising a gene coding for a gene 
product which, when expressed has a killing or cell function- 
limiting effect in said cell, the expression of said gene 
5 coding for the cell killing or cell function- limiting gene 
product being regulated in such a manner that the bacterial 
cell when targeted will be killed or limited in its function 
in a pre -determined manner. 



29. A recombinant or mutant bacterial adhesin variant derived 
10 from a naturally occurring parent adhesin, said adhesin 
variant having altered binding properties relative to the 
naturally occurring adhesin from which it is derived, the 
altered binding properties including binding to at least one 
receptor to which the parent adhesin does not bind. 

15 30. An adhesin variant according to claim 29 which is derived 
from a naturally occurring adhesin isolated from a cell 
structure selected from the group consisting of a capsule, a 
lipopolysaccharide layer, an outer membrane protein, a 
flagellum, a pilus, a fimbria, a non-fimbrial adhesin (NFA) 

20 and an af imbrial adhesin (AFA) . 

31. An adhesin variant according to claim 29 or 30 which is a 
protein having an amino acid sequence differing by at least 
one amino acid residue from its parent protein adhesin. 

32. An adhesin variant according to claim 29 which is a FimH 
25 mannose- sensitive adhesin having an amino acid sequence which 

differs from the E. coli PC31 FimH adhesin as defined in 
Table 1 herein by at least one amino acid, said FimH adhesin 
binding to a receptor selected from the group consisting of a 
domain where mannosyl residues are not terminal and a domain 
30 devoid of saccharide. 

33. An adhesin variant according to claim 32 which is at 
least 90% homologous to the PC31 FimH adhesin. 
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34. An adhesin variant according to claim 32 which is a 
chimeric adhesin comprising amino acid sequences from differ- 
ent FimH adhesins. 

35. An adhesin variant according to claim 29 which binds to a 
receptor moiety selected from the group consisting of an 
animal receptor moiety, a plant receptor moiety and an inani- 
mate receptor moiety. 

36. An adhesin variant according to claim 29 which is part of 
a fusion protein comprising the adhesin variant and a 
heterologous polypeptide . 

37 An adhesin variant according to claim 36 wherein the 
heterologous polypeptide is selected from the group consist- 
ing of an epitope, an enzyme, a toxic gene product and an 
antibody. 

38. A FimH adhesin comprising 279 amino acids, having an •' 
amino acid sequence which differs from the E. coll PC31 FimH 
adhesin as defined in Table 1 herein by at least one amino 
acid. 

39. A FimH adhesin according to claim 38 which has an amino 
acid sequence which is selected from the group of sequences 
appearing in Fig. 5 herein with designations CI#12, CI#4, 
CI#7 or CSH-50. 

40. An adhesin according to claim 38 which binds to a 
receptor selected from the group consisting of a domain where 
mannosyl residues are not terminal, a domain devoid of 
saccharide, a glycolipid, a glycoprotein, a protein, a 
polypeptide and a peptide. 



41. An adhesin according to claim 38 which when tested for 
binding to yeast mannan (Mn) , human plasma f ibronectin (Fn) , 
periodate treated Fn and the synthetic peptide FnSpl compris- 
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ing the first 30 amino acids of Fn, only binds to Mn (M class) . 

42. An adhesin according to claim 38 which when tested for 

binding to yeast mannan (Mn) , human plasma fibronectin (Pn) , 

periodate treated Fn and the synthetic peptide FnSpl compris- 

5 ing the first 30 amino acids of Fn, binds to Mn and Fn (MF 
class) . 

43. An adhesin according to claim 3 8 which when tested for 
binding to yeast mannan (Mn) , human plasma fibronectin (Fn) , 
periodate treated Fn and the synthetic peptide FnSpl compris- 
ing the first 30 amino acid residues of Fn, binds to all of 
these (MFP class) . 



10 



44. An adhesin according to claim 38 which when tested for 
binding to five Fn- fragments obtained by thermolysin treat- 
ment, only binds to the 40-kDa gelatin -binding fragment. 

15 45. An adhesin according to claim 38 which when tested for 
binding to five Fn- fragments obtained by thermolysin treat- 
ment, binds to all five fragments. 

46. An adhesin according to claim 38 which is at least 90% 
homologous to the PC31 FimH adhesin. 



20 



25 



47. An adhesin according to claim 38 which is a chimeric 
adhesin comprising amino acid sequences from different FimH 
adhesins. 

48. An adhesin according to claim 38 comprising an amino acid 
sequence which differs from the E. coll PC31 FimH adhesin by 
at least one amino acid occurring between residues 27 and 119 
of the mature FimH sequence. 



,49. An adhesin according to claim 48 which binds to a 
receptor moiety selected from the group consisting of a human 
receptor moiety, an animal receptor moiety and a plant 
30 receptor moiety. 
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50, A recombinant replicon comprising a DNA sequence selected 
from the group consisting of a sequence coding for a recombi- 
nant bacterial adhesin as defined in claim 29 and a sequence 
coding for a FimH adhesin as defined in claim 38. 

5 51. A recombinant replicon according to claim 50 wherein the 
DNA sequence codes for a FimH adhesin having an amino acid 
sequence which differs from the E. coll PC31 FimH adhesin by 
at least one amino acid. 

52. A replicon according to claim 52 in which the DNA 

10 sequence is at least 90% homologous to the PC31 flmH gene. 

53. A replicon according to claim 50 in which the DNA 
sequence is a chimeric flmH gene comprising DNA from differ- 
ent flmH genes . 

54. A replicon according to claim 50 in which the DNA 

15 sequence comprises a DNA sequence which differs from the E. 
coll PC31 flmH gene by at least one codon between the codons 
coding for amino acid residues 27 and 119 of the mature FimH 
sequence . 

55. A replicon according to claim 50 in which the DNA 
20 sequence comprises a further DNA sequence coding for a 

heterologous polypeptide. 

56. A replicon according to claim 55 wherein the polypeptide 
is selected from a group consisting of an epitope, an enzyme, 
a toxic gene product and an antibody. 

57. A replicon according to claim 55 wherein the further DNA 
sequence codes for a gene product which is selected from a 
pesticidally active gene product and a pollutant -degrading 
gene product. 

58. A replicon according to claim 50 wherein the DNA sequence 
30 is isolated from an EnteroJbacteriaceae species. 



25 
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59. A fusion protein comprising an adhesin selected from the 
group consisting of a recombinant bacterial adhesin variant 
as defined in claim 29 and a FimH adhesin as defined in claim 
38, and a heterologous polypeptide. 

5 60. A fusion protein according to claim 59 wherein the 
heterologous polypeptide is selected from an epitope, an 
enzyme, a toxic gene product and an antibody. 

61. A fusion protein according to claim 59 which carries a 
non-covalently bound compound. 

10 62. A bacterial cell which expresses an adhesin selected from 
the group consisting of a recombinant bacterial adhesin 
variant as defined in claim 29 and a FimH adhesin as defined 
in claim 38. 



15 



20 
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63. A recombinant bacterial cell according to claim 62 which 
comprises a recombinant replicon as defined in claim 50. 

64. A bacterial cell according to claim 62 which is selected 
from finterobactex-iaceae, Pseudomonadaceae , Vlbrlonaceae and 
Baccilaceae. 

65 . A bacterial cell according to claim 62 which further 
expresses a gene product selected from the group consisting 
of a pesticidally active compound, an immunologically active 
gene product and a pollutant -degrading active compound. 

66. A bacterial cell according to claim 62 in which the 
recombinant adhesin variant is expressed as a fusion protein 
comprising the adhesin variant and a further polypeptide. 



67. a bacterial cell according to claim 62 which further 
comprises a gene coding for a gene product which, when 
expressed has a killing or cell function- limiting effect in 
said cell, the expression of said gene coding for the cell 
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killing or cell function- limiting gene product being regu- 
lated in such a manner that the bacterial cell when targeted 
to receptor in a specific location will be killed or limited 
in its function in a pre -determined manner. 



5 68. A method of isolating a bacterial cell expressing an 
adhesin having modified binding properties relative to a 
natively expressed adhesin, comprising identifying in the 
bacterial cell DNA sequence (s) coding for the binding 
domain (s) of said natively expressed adhesin and substituting 

10 at least one codon herein, whereby a modified adhesin mole- 
cule is expressed that is different in at least one amino 
acid from the adhesin expressed natively, and selecting a 
bacterial cell expressing the modified adhesin having an 
altered adhesion phenotype relative the natively expressed 

15 bacterial adhesin. 

69. A method according to claim 68 wherein a non-adhesin 
compound is associated with the adhesin. 

70. A method according to claim 69 wherein the non-adhesin 
compound is associated with the adhesin by being expressed 

20 with the adhesin as part of a fusion protein comprising the 
adhesin. 

71. A method according to claim 68 which in a further step 
comprises binding non-covalently a compound to the adhesin 
when expressed. 

25 72. A method according to claim 68 wherein the natively 
expressed adhesin is a PimH adhesin. 

73. A method according to claim 68 wherein the codon (s) 
is/are substituted by mutagenization. 

74. A method of preparing a recombinant bacterial cell that 
JO binds to a specific receptor moiety, comprising introducing 

into a bacterium that does not produce an adhesin binding to 
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said receptor moiety, a DNA sequence coding for an adhesin 
binding to the receptor moiety, and selecting a bacterial 
cell expressing the DNA sequence. 

75. A method according to claim 74 wherein the DNA sequence 
! coding for an adhesin binding to the receptor moiety is a 

sequence coding for a FimH adhesin. 

76. A method according to claim 74 wherein the DNA is intro- 
duced by transforming the bacterial cell with a recombinant 
replicon as defined in claim 50. 

77. A method according to claim 74 wherein a non-adhesin 
compound is associated with the adhesin. 



78. A method according to claim 77 wherein the non-adhesin 
compound is associated with the adhesin by being expressed 
with the adhesin as part of a fusion protein comprising the 

15 adhesin. 

79. A method according to claim 74 which in a further step 
comprises binding non-covalently a compound to the adhesin 
when expressed. 



20 



80. A method of providing a mutant bacterial cell having 
fimbriae which binds to a moiety to which the wild- type cell 
from which the mutant cell is derived does not bind, compris- 
ing contacting a population of said wild- type cell with said 
moiety, removing the contacted cells which do not bind to the 
moiety, cultivating cells binding to the moiety to obtain a 
25 culture which is enriched with regard to cells binding to the 
moiety and selecting from said culture a mutant cell binding 
to said moiety. 



81. a method according to claim 80 wherein the moiety with 
which the wild -type cell population is contacted, is a 
30 casein. 
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82. A method of isolating a compound from a solution or 
suspension containing the compound, the method comprising 
contacting the solution or the suspension with a fusion 
protein according to claim 59 wherein the heterologous 

i polypeptide has an affinity to the compound to be isolated. 

83. A composition comprising a population of a bacterial cell 
as defined in claim 62. 
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